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Sd. °' P^^^^'""- '^"9th cDNAs encoding the proteins are pro- 

^rP.I^^n ^'f!'"' ^""^ Polynucleotides encoding these proteins have been isolated. The proteins of the 
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Description 

FIELD OF THE INVENTION 



BACKGROUND OF THE INVENTION 

[0002] Currently,sequencingprojec.s,lhedetermlnatk)nandanalysisofihegenomicDNAofvariouslivm^ 
are m progress all over the world. T1,e whole genomic sequences or more than 10 species of prokaJi a"r 
eukaryote, yeas, and a multicellular eukaryole. C. elegans have l,een already detem,ined. As to'he humlTgenoT 
wh>ch ,s supposed to be composed of three thousand million base pairs, w"nd wide cooperative prS aS und"; 

detenT„nat.on of genom-c sequence is to reveal the functions of all genes and their regulation and to understand Hv no 
organ,sms as a network of interactions between genes, proteins, cells or individuals through deduc^ig theTnt™^^^^^ 

bvuZo.het:„rT'n'?'''""'°'^ 

by utteng he genomic .ntormat,on from various species is not only important as 

s-gn,fjcant from the viewpoint of industrial application. However, determination of genomic sequences itseS (^^not 
■denhty the tunctrons of all genes. For example, lor yeast, the function of only approximately haToMhe 6000 JnS 

t^f^J. .^ "^rT y one hundred thousand. Therefore, it is desirable to establish "a high throughputln^ljsfe 

n^tl ,K '""'""^ '^'^^ ^'^^ ^ identify rapidly and efficiently the functions of vasramounts of the genes 
obtained by the genomic sequencing. "u ua ui u ib yenes, 

n^K,!!!^"! '^^ ^"""^ '""'«P'^ exons. TTius. it is difficult to predict 

correct^ the structure of encoded proteins solely based on genomic infomiation. In contrast, cDNA which is prSd 

T ' "«>^e million ESTs (ExprS 

B^ Z humanTen^ """"^ '^-'^^-)- ^^Ts presumably cover not iSL 

!S JTJ'""^'^" """^^ ^"^'y"'"^ '"^"^"'^ °f •^""'3" genome, or for predicting the exon- 

regions of genomic sequences or their expression profile. However, many human ESTs have been derived from orox- 

S ."nS r..T r r'^r*"^ """""^ ^^^^^ """''""^ ^^^^^'^ reduced is approximT; 

7000. and further, the number of full-length clones is ony 5500. Thus, even including cDNA registered as EST the 
percentage of human cDNA obtained so far is estimated to be 10-15% of all the genes '^^'^'^'^'^ ^^T. the 
[00051 It B possible to identify the transcription start site of mRNA on the genomic sequence based on the ff-end 
sequenceofaull-length CDNA, andtoanavzefactors involved in the stability Of mRNA that is co^^^^ 

slrt tlT^T: """T'"" ^'^9^- ^'"'^ ^ '""-'-S'^ '^DNA contains ATG^me trans^ton 

start site, in he 5'-reg,on, .t can be translated into a protein in a correct frame. Therefore, it is possible to produce a 

large amoun Of theprotein encoded by the CDNA or to analyze biological activity Of the e>^es^%o^^^^^^ 

the information from genome sequencing. Also, full-length cDNA clones that can be expressed are extremely vaSle 
in empincal analysis of gene function and in industrial application extremely valuable 

[00061 In particular, human secretory proteins or membrane proteins would be useful by itself as a medicine like 
^^Z^T'""^" ^^f'*)' °' ^« ^ '^'9^> °' '"edicines like membrane receptors 

[00071 -nierefore. it has great significance to isolate novel full-length cDNA clones of humans, of which onh, a few 

I're^r T"^'"''- °' ' •^'^^ "^'""^ '"''^'"^ ^ ^^^'o-y P™'e'" 0^ membrane pttl J 

desired since the protein itself, or a molecule that interacts with the membrane protein wouW be useful as a LdSe 

clon^^n^rt '"'"T"" f " ""^ Wentification of the tul JengTcDNA 

clones encoding those proteins has great significance. 

. SUMf^ARY OF THE INVENTION 

[OOM] An objecth/e of the present invention is to provide a polynucleotide encoding a novel protein, a protein encoded 
by said polynucleotide, and novel usages of these. c ..ajjioiemencoaea 

\T!rr TT'^. ^'.fr ''^^«'°P«'^ a method for efficiently cloning a human full-length cDNA that is predicted by 
theATGprelc.tobeafull-leng,hcDNAclone.fromaf«ll-length-enrichedcDNAIIbra,ythatissynlhesizedb^^^^^^^^^ 
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capping method (K. Mamyama and S. Sugano. Gene. 138: 171-174 (1994); Y. Suzuki ei al. Gene 200 149-156 
(1997)J. Then, the inventors determined the nucleotide sequence of the obtained cDNA dones from both 5'- and 3: 
ends. By utilizing the sequences, the inventors selected clones that were expected to contain a signal by the PSORT 
(Nakai K. and Kanehisa M. (1992) Genomics 14: 897-911). and obtained clones that contain a cDNA encoding a 
secretoty protein or membrane protein. The inventors found that it is possible to synthesize a novel full-lenqth cDNA 
by using the combination of a primer that is designed based on the nucleotide sequence of the 5'-ends of the selected 
ful-lengthcDNAdonesandanyofanoligo-drprimerora3--primerth 
01 the 3 -ends of the selected clones. 

[0O1OI TTie full-length cDNA clones of the present invention have high fullness ratio since these were obtained by 
the combination of (1) construction of a full-length-enriched cDNA library that is synthesized by the oligo-cappina 
method, and (2) a system in which fullness ratio is evaluated from the nucleotide sequence of the S'-end 
[OOl l] Furthemiore. the Inventors have analyzed the nucleotide sequence of the fuO-length cDNA clones obtained 
by the method, and deduced the amino acid sequence encoded by the nucleotide sequence. Then, the inventois have 
^llT^V ^"^^e^fc*' (AltschulS.F, Gish W.. Miller W.. Myers E.W.. and Upman D.J. (1990) J. Mol Biol 215- 
403-410; Gish W.. and States D.J. (1993) Nature Genet. 3: 266-272; httpVAvww.ncbi.nlm.nih.gov/BLAST/) of the Gen- 
Bank {http7/www.ncbl.nlm.nih.gov/Web/GenBank^ndex.html) and SwissProt (http://www.ebi.ac.uk/ebidocs/ 
swissprot_db/swisshome.html) using the deduced amino add sequence to accomplish the present invention " 
[00121 Homology analysis in which the analysis is carried out against a non-full-length cDNA fragment to postulate 
the function of a protein encoded by said fragment, is being commonV performed. However, since such analysis is 
based on the infomriation of the fragment, it is not dear as to whether this fragment corresponds to a part that is 
functionally imponant in the protein. In other words, the reliability of the homology analysis based on the infomiation 
of a fragment is doubtful, as infonmation relating to the structure of the i^o\e protein is not available However the 
homology analysis of the present invention is conducted based on the information of a full-length cDNA comprising 
the whole coding region of the cDNA. and therefore, the homology of various portions of the protein can be analyzed 
Hence, the reliability of the homology analysis has been dramatically improved in the present invention 
[001 3] The present invention relates to the polynucleotide mentioned below, a protein encoded by the polynucleotide 
and their usage. # k / ^- -uc. 

[0014] First, the present invention relates to 

(1) an isolated polynudeotide selected from the group consisting of 

(a) a polynudeoUde comprising a coding region of the nucleotide sequence set forth in any one of the SEQ ID 

NOs \n Table l ; 

(b) a polynucleotide comprising a nucleotide sequence encoding a protein comprising the amino add sequence 
set forth in any one of the SEQ ID NOs in Table 1 ; h 

(0 a polynudeotide comprising a nucleotide sequence encoding a protein comprising an amino acid sequence 
selected from the amino acid sequences set forth in the SEQ ID NOs in Table 1 . in whidi one or more amino adds 
are substituted, deleted, inserted, and/or added, wherein said protein is functionally equivalent to the protein com- 
preing said ammo acid sequence selected from the amino add sequences set forth in the SEQ ID NOs in Table 1 • 

(d) a polynucleotide that hybridizes with a poVnudeotide comprising a nucleotide sequence selected from the 
nudeohde sequences set forth in the SEQ ID NOs in Table 1 , and that comprises a nucleotide sequence encoding 
a protein funrtionally equivalent to the protein encoded by the nudeotide sequence selected from the nudeotide 
sequences set forth in the SEQ ID NOs in Table 1 ; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino add sequence of a protein en- 
coded by the polynucleotide of (a) to (d); 

(0 a polynudeotide comprising a nucleotide sequence with at least 70% identity to the nudeotide sequence set 
forth in any one of the SEQ ID NOs in Table 1 . 

f°°^!.Lrf?'® ^ "^""^ °' '"^ "^^'^ examples described later, comprising the full- 

length CDNA of the present invention, the conresponding SEQ ID NOs. of the nucleoUde sequences of the cDNA clones 
and the corresponding SEQ ID NOs. of the amino acid sequences deduced from the d3NA nudeotide sequences. ' 

Table 1 



Amino acid sequence 


Nucleotide sequence 


Clone Name 


SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 


SEQ ID NO: 1 
SEQ ID NO: 3 
SEQ ID NO: 5 


PSEC0001 
nnnnnnnn 
PSEC0005 
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Table 1 (continued) 



25 



Amino acid sequena 


i Nucleotide sequence 




£>EQ ID NO: 8 


SEQ ID NO: 7 


rocOOOOZ 


SEQ ID NO: 10 


SEQ ID NO; 9 




SEQ ID NO: 12 


SEQ ID NO: 11 


DC C/^nn-i o 


SEQ ID NO; 14 


SEQ ID NO; 13 


"Otl^UUI / 


SEQ ID NO: 16 


SEQ ID NO- IS 


r'otO0019 


SEQ ID NO: 18 


SEQ ID NO- 17 


P0ECO020 


SEQ ID NO: 20 


SEO ID NO- 10 


PbEC0021 


SEQ ID NO: 22 


55FO in Nin- Oi 


PSEG0028 


SEQ ID NO: 24 




PSEC0029 


SEQ ID NO: 26 


SEO in NO- 9*; 


PSEC0030 


SEQ ID NO: 28 




PSEC0031 


SEQ ID NO: 30 


SFn in MO- OQ 


PSEC0035 


SEQ ID NO: 32 




PSEC0038 


SEQ ID NO: 34 




PSEC0040 


SEQ ID NO: 36 


55Fn in Kin- r»c 


PSEC0041 


SEQ ID NO: 38 


SFO in wn- 


PSEC0045 


SEQ ID NO: 40 


^P/^ in Kir\- Oft 

OXZ\J lU viKJ. o9 


PSEC0048 


SEQ ID NO: 42 


^FO m M/*^- AA 


PSEC0D49 


SEQ ID NO: 44 


ocVJ lU INU. 4o 


PSEC0051 


SEQ ID NO: 46 


otW lU NU: 45 


PSEC0052 


SEQ ID NO: 48 


QPO ir\ M/^- A-r 
otU lU fNU: 47 


PSEC0053 


SEQ ID NO: 50 




PSEC0055 


SEQ ID NO: 52 


otvj (U NU: 51 


PSEC0059 


SEQ ID NO: 54 


ocVJ lU ViKJ. 53 


PSEC0061 


SEQ ID NO; 56 




PSEC0068 


SEQ ID NO: 58 


QPO in MO- e;7 


PSEC0070 


SEQ ID NO: 60 


QPO in M/^- cn 


PSEC0071 


SEQ ID NO: 62 


*^FO in hir\' CI 


PSEC0072 


SEQ ID NO: 64 


omj lu iMU. Do 


PSEC0073 


SEQ ID NO: 66 


*5PO in MO- nc 
OCVJ lU ViKJ. DD 


PSEG0074 


SEQ ID NO: 68 


*^FO in MO- 


PSEG0075 


SEQ ID NO: 70 


SEO ID NO- RQ 


PSEC0076 


SEQ ID NO: 72 


CpO in MO- 7i 


PSEC0077 


SEQ ID NO: 74 


SEQ ID NO- 


rb hO0079 


SEQ ID NO: 76 


Qpn jn MO- 


PSECOO8O 


SEQ ID NO: 78 


SFO in wo- TV 


PSEC0081 


SEQ ID NO: 80 


SEQ ID NO: 79 


to 0082 


SEQ ID NO: 82 


SEO ID NO- ni 


rb bu0085 


SEQ ID NO; 84 


SEO ID NO- fv\ 


PbEC0086 


SEQ ID NO: 86 


SEQ ID NO: 85 


Kt>tO0087 


SEQ ID NO: 88 


SEO ID NO* ft7 


r5>tU0088 


SEQ ID NO; 90 


SEO ID NO- ftQ 


PotCr0090 


SEQ ID NO; 92 


SEQ ID NO; 91 


PSEC0094 


SEQ ID NO: 94 


SEQ ID NO; 93 


PSEC0095 


SEQ ID NO; 96 


SEQ ID NO: 95 


PSEC0098 


SEQ ID NO; 98 


SEQ ID NO; 97 


PSEC0099 


SEQ ID NO: 100 


SEQ ID NO: 99 I 


PSEC0100 


SEQ ID NO: 102 i 


SEQ ID NO: 101 1 


^SECOIOI 


SEQ ID NO: 104 « 


SEQ ID NO: 103 \ 


^SEC0104 


SEQ ID NO; 106 ; 


SEQ ID NO: 105 | PSEC0105 
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Table 1 (contin ued) 



Amino add sequence 


i Nucleotide sequence 


J Clone Name 


ybUlDNO:108 


i>EQ ID NO: 107 


PSEC0106 


SEQ ID NO: 110 


SEQ ID NO: 109 


PSECOI 07 


SEQ ID NO: 112 


SEQ ID NO: 111 


PSECOloa 


SEQ ID NO: 114 


SEQ ID NO: 113 


PSECOI OQ 


SEQ ID NO: 116 


SEQ ID NO: 115 


P55FC011O 


SEQ ID NO: 118 


SEQ ID NO: 117 


PSEC0111 


SEQ ID NO: 120 


SEQ ID NO: 119 


i OCOv 1 1 ^ 


SEQ ID NO: 122 


SEQ ID NO: 121 




SEQ ID NO: 124 


SEQ ID NO: 123 


PRFPniiQ 


SEQ ID NO: 126 


SEQ ID NO: 125 


rOCVjfU i 


SEQ ID NO: 128 


SEQ ID NO: 127 




SEQ ID NO: 130 


SEQ ID NO: 129 




SEQ ID NO: 132 


SEQ ID NO: 131 




SEQ ID NO: 134 


SEQ ID NO: 133 




SEQ ID NO: 136 


SEQ ID NO: 135 


PQpr'm 07 


SEQ ID NO: 138 


SEQ ID NO: 137 




SEQ ID NO: 140 


SEQ ID NO: 139 




SEQ \0 NO: 142 


SEQ ID NO: 141 




SEQ ID NO: 144 


SEQ ID NO: 143 




SEQ ID NO: 146 


SEQ ID NO: 145 


PSECOI 33 


SEQ ID NO: 148 


SEQ ID NO: 147 


PSECOI 34 


SEQ ID NO: 150 


SEQ ID NO: 1 49 


rotoUloo 


SEQ ID NO: 152 


SEQ ID NO: 151 


rotLrUl oo 


SEQ ID NO: 154 


SEQ ID NO: 1 53 




SEQ ID NO: 156 


SEQ ID NO: 155 




SEQ ID NO: 158 


SEQ ID NO: 157 




SEQ ID NO: 160 


SEQ ID NO: 159 




SEQ ID NO: 162 


SEQ ID NO: 161 


nnnnnnnn 


SEQ ID NO: 164 


SEQ ID NO: 163 


PQPPni A'T 


SEQ ID NO: 166 


SEQ ID NO: 165 


PQPPn-t AQ 


SEQ ID NO: 168 


SEQ ID NO: 167 


P*^ppni c^n 


SEQ ID NO: 170 


SEQ ID NO: 169 


P^FPfH fil 
» ocv>u 1 O I 


SEQ ID NO: 172 


SEQ ID NO: 171 


P*^ FPni i^o 


SEQ ID NO: 174 


SEQ ID NO: 173 




SEQ ID NO: 176 


SEQ ID NO: 175 




SEQ ID NO: 178 


SEQ ID NO: 177 


pRFnm fii 


SEQ ID NO: 180 


SEQ ID NO: 179 


psFnoi fii> 


SEQ ID NO: 182 


SEQ ID NO: 181 




SEQ ID NO: 184 


SEQ ID NO: 183 


PSECOI fi4 


SEQ ID NO: 116 


SEQ ID NO: 185 


PSECOI fiS 


SEQ ID NO: 188 


SEQ ID NO: 187 


PSECOI 67 


SEQ ID NO: 190 


SEQ ID NO: 189 


PSECOI 68 


SEQ ID NO: 192 


SEQ ID NO: 191 


PSECOI 69 


SEQ ID NO: 194 


SEQ ID NO: 193 


PSECOI 70 


SEQ ID NO:. 196 


SEQ ID NO: 195 


PSECOI 71 


SEQ ID NO: 198 


SEQ ID NO; 197 


PSEC0172 


SEQ ID NO: 200 


SEQ ID NO: 199 


PSECOI 73 


SEQ ID NO: 202 


SEQ ID NO: 201 


PSECOI 78 


SEQ ID NO: 204 


SEQ ID NO: 203 


PSECOI 61 
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Table 1 (continued) 



Amino acid sequence 



SEQ ID NO; 206 
SEQ ID NO: 208 
SEQ ID NO: 210 
SEQ ID NO: 212 
SEQ ID NO: 214 
SEQ ID NO: 216 
SEQ ID NO: 218 
SEQ ID NO: 220 
SEQ ID NO: 222 
SEQ ID NO: 224 
SEQ ID NO: 226 
SEQ ID NO: 228 
SEQ ID NO : 230 
SEQ ID NO: 232 
SEQ ID NO: 234 
SEQ ID NO: 236 
SEQ ID NO: 238 
SEQ ID NO: 240 
SEQ ID NO: 242 
SEQ ID NO : 244 
SEQ ID NO: 246 
SEQ ID NO: 248 
SEQ ID NO: 250 
SEQ ID NO: 252 
SEQ ID NO: 254 
SEQ ID NO: 256 
SEQ ID NO: 258 
SEQ ID NO: 260 
SEQ ID NO: 262 
SEQ ID NO: 264 
SEQ ID NO: 266 
SEQ ID NO: 268 
SEQ ID NO: 270 
SEQ ID NO: 272 
SEQ ID NO: 274 
SEQ ID NO: 276 
SEQ ID NO: 278 
SEQ ID NO: 280 
SEQ ID NO: 282 
SEQ ID NO: 284 
SEQ ID NO: 286 
SEQ ID NO: 288 
SEQ ID NO: 290 
SEQ ID NO: 292 
SEQ ID NO: 294 
SEQ ID NO: 296 
SEQ ID NO: 298 
SEQ ID NO: 300 
SEQ ID NO: 302 
SEQ ID NO: 304 



Nucleotide sequence 



SEO ID NO: 205 
SEQ ID NO: 207 
SEQ ID NO: 209 
SEO ID NO: 211 
SEQ ID NO: 213 
SEQ ID NO: 215 
SEQ ID NO: 217 
SEQ ID NO: 219 
SEQ ID NO: 221 
SEQ ID NO: 223 
SEQ ID NO: 225 
SEQ ID NO: 227 
SEQ ID NO: 229 
SEQ ID NO: 231 
SEQ ID NO: 233 
SEQ ID NO: 235 
SEQ ID NO; 237 
SEQ ID NO: 239 
SEQ ID NO; 241 
SEQ ID NO: 243 
SEQ ID NO: 245 
SEQ ID NO: 247 
SEQ ID NO: 249 
SEQ ID NO: 251 
SEQ ID NO: 253 
SEQ ID NO: 255 
SEQ ID NO: 257 
SEQ ID NO: 259 
SEQ ID NO: 261 
SEQ ID NO: 263 
SEQ ID NO: 265 
SEQ ID NO: 267 
SEQ ID NO: 269 
SEQ ID NO: 271 
SEQ ID NO: 273 
SEQ ID NO: 275 
SEQ ID NO: 277 
SEQ ID NO: 279 
SEQ ID NO: 281 
SEQ ID NO: 283 
SEQ ID NO: 285 
SEQ ID NO: 287 
SEQ ID NO: 289 
SEQ ID NO: 291 
SEO ID NO: 293 
SEQ ID NO: 295 
SEQ ID NO: 297 
SEQ ID NO: 299 
SEQ ID NO: 301 
SEQ ID NO: 303 



Clone Name 



PSEC0182 
PSEC0183 
PSEC0190 
PSEC0191 
PSEC0192 
PSEC0197 
PSEC0198 
PSEC0199 
PSEC0200 
PSEC0203 
PSEC0204 
PSEC0205 
PSEC0207 
PSEC0209 
PSEC0210 
PSEC0213 
PSEC0214 
PSEC0215 
PSEC0216 
PSEC0218 
PSEC0220 
PSEC0222 
PSEC0223 
PSEC0224 
PSEC0226 
PSEC0227 
PSEC0228 
PSEC0230 
PSEC0232 
PSEG0233 
PSEC0235 
PSEC0236 
PSEG0240 
PSEC0241 
PSEC0243 
PSEC0244 
PSEC0245 
PSEC0246 
PSEC0247 
PSEC0248 
PSEC0249 
PSEC0250 
PSEC0252 
PSEC0253 
PSEG0255 
PSEC0258 
PSEC0259 
PSEC0260 
PSEC0261 
PSEC0263 
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Table 1 (continued) 





Amino acid sequence 


Nudeoyde <5fHiiiPnr*» 


^✓lone Name 




SEQ ID NO: 306 


5iFn in wn- 


rbbO0027 


5 


SEQ ID NO: 308 




PSEC0047 




SEQ ID NO: 310 




PSEC0066 




SEQ ID NO: 312 


OCW IL* oil 


nnnnnnnn 


10 


SEQ ID NO: 314 


cipo in MO- '511 

OCU lU nJU. Olo 


PSEC0059 




SEQ ID NO: 316 


ot=u lU iNU. olo 


PSEC0092 




SEQ ID NO: 318 


otw (U (MU. ol7 


PSEC0103 




SEQ ID NO- 3?0 


otw lU NU: ol9 


PSEC0117 




SEQ ID NO: 322 


otU lU riU: 321 


PSEC0142 


15 


SEQ ID NO- 


obU ID NO: 323 


PSEC0212 




SEQ ID NO- '^PR 


obU \u NO: 325 


PSEC0239 




SEQ ID NO- '\9n 


ceo IP* M/^. OOT 

obU ID NO: 327 


PSEC0242 






obQ ID NO: 329 


PSEC0251 


20 


SEQ ID NO: 332 


SEQ ID NO: 331 


PSEC0256 


SEQ ID NO; 334 


SEQ ID NO: 333 


PSEC0195 




SEQ ID NO: 336 


SEQ ID NO:.335 


PSEC0206 




SEQ ID NO: 342 


SEQ ID NO: 341 


PSEC0078 




SEQ ID NO: 344 


SEQ ID NO: 343 


PSEC0084 


25 


SEQ ID NO: 346 


SEQ ID NO: 345 


PSEC0237 




SEQ ID NO: 348 


SEQ ID NO: 347 


PSEC0264 




SEQ ID NO: 350 


SEQ ID NO: 349 


PSEC0265 



(2) A substantially pure protein encoded by the polynucleotide of (1 ) 

^ fj** ^ « primer for synthesizing the polynucleotide comprising the nucleotide sequence 

. ^"if °' '° ^'"-^ "'^ complementary strand thereof, wherein said ohSS 
« c^plementary to sa,d polynucleotide or the complementary strand thereof and comprises at lo^nsZSl 

(4) Aprimer set forsynthesizing polynucleotides, theprimer set comprising an oligo-m-primerand an on^^^ 
ot|de complemen^n. to the cor.ptea,en.a,y strand of the polynucleotide compnsfng the nuclide C^SS 

Ta oZZ f ^^T ""'^ ""^"""^•eowe comprises at least ,5 nuclSt 

(5) A pnmer set for synthesizing polynucleotides, the primer set comprising a combination of an oliaonudeotide 

Llr^lTJ ^"^""^ ^""^ ^" Oligonucleotide comprising a nucleotide sequence complementa^trthe 
ni^f H 1°"^"""^ ' ' '"^ """'"""■''^ ^^''^ Oligonucleotides comprL at le^TlS nu 

cleohdes and where,n sa,d combination of 5'-end nucleotide sequenceAJ'-end nucleotide s^uence is se ^.i, 
J^m the comb.nat,ons of 5'-end nucleotide sequence«--«nd nucleotide sequence set forth inTe SqId S 

(6) A polynucleotide that can be synthesized with the primer set of (4) or (5) 

(7) A polynucleotide comprising a coding region in the polynucleotide of (6) 

(8) A protein encoded by polynucleotide of (7). 

(9) A partial peptide of the protein of (8). 

(10) An antibody against the protein or peptide of any one of (2), (8), and (9) 

(11) A vector comprising the polynucleotide of (1) or (7). 

(12) A transformant canying the polynucleotide of (i) or (7). or the vector of (11) 

13) A transformant expressively carrying the polynucleotide of (1) or (7), or the vector of (11) 

l^rian™ 1^^"^ '"fr"'" °' ""^ °' ^"'^ '=°'"P^'"9 <^"""rin9 the trans- 

lormant Of (13) and recovenng the expression product 

(15) An oligonucleotide comprising the nucleotide sequence set forth in any one of the SEO ID NOs in Table 1 or 
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(16) Use of the oligonucleotide of (15) as a primer lor synihesizing a polynucleotide 

(1 7) Use of tfie oligonucleotide of (15) as a probe for detecting a gene 

(18) An antisense polynucleoOde against the polynucleotide ol (1), or the porton thereof 

(1 9) A method for synthesizing a polynucleotide, the method comprising: 

b) recovering the synthesized product. 

(20) The method of (19). wherein the cDNA library is obtainable by oligo^pping method 

(21) The method of (19). wherein the complementary strand Is obtainable by PGR 

(22) A method for detecting the polynucleotide of (i). the method comprising: 

a) inc^bating a target polynucleotide with the oligonucleotide of (1 5) under the conditions where hybridization 

b) detecting the hybridization of the target polynucleotide with the oligonucleotide of (15). 

set lonn m the SEQ ID NOs in Table 1 , or a medium on which the database is stored. 

o?p,plHllf^r.''H'^^',^^°.'''' °' '"^"'"'^ 5-- and 3--ends in the full-length cDNA of 

the present .nvenuon (173 clones), and the corresponding plasmid clones obtained in the examples de^ribed later 

the name o .hl v ""h '° °' '"^ ^--sequence are shown on the right side of 

siiuenc^ ^ • ' ' °' ^^""""^^ °" '''^ of the name of the 3 - 

[0018] Any patents, patent applications, and publications cited herein are incorporated by reference. 
BRIEF DESCRIPTION OF THE DRAWINGS 

[0019] Figure 1 shows the restriction maps of vectors pMEl 8SFL3 and pUCI 9FL3 

^ '"^ reproducibility of gene expression analysis. T7,e ordinate and the abscissa show the 
rntensihes of gene expression obtained in experiments different from each other 

[0021] Figure 3 shows the detection limit in gene expression analysis. TTie intensity of expression is shown in the 
r^. ^ 'concentration (^g^ml) of the probe used is shown in the abscissa 

Z^L T r ^ P^^'^S^^Ph showing results of analyzing temporal expression of PSEC clones in NT cells at a 

TT"f^^^ ^"'^ at 1 . 3. or 5 weeks after rebnoic acid^reatment using RT-PCR. 
10023] PGR conditions (annealing temperature and 4 kinds of cycle numbers) used are indicated under the resoectivP 
clonenamesorgene names. RA(OandRA(.)represent undifferentiated NT2UandNT2^^^ 

ofSrr«rro:r^''"'"'^^ 

Sow K "'p^'^cP" ff""*^"'' '"^ ^""^^""9 te-^Perature was the same as that used in Figure 4. Each sample was 

r^' « i """''""""^ ""'"''^^ «" '^^^ as indicated in the figure. 

Si frorplErVl?pf rll^p '^""^ '"^ '^^ amplification products 

denv^ from PSEG clones. PGR conditrons (the number of cycles) used are indicated adjacent to the respective done 

,h™? "^^'^ ^"P^^^"' undifferentiated NT2 cells and NT2 cells respectiMureS in 

the presence of retmoic acid. Each point presented on the diagram was detennined as a ratio obtainSS fo tows 

XcuCd nT''"^'"" ""^ '^^'"9 .he amount of the products n NT2 

cells cultured m the presence of retinoic acid for l week as 1 . 
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DETAILED DESCRIPTION OF THE INVENTION 

rlT ! no '"""afons ,n the number of .he polymerized nucleotides. In case the polymer contarns 

relabvely tow number Of nucleotidesjti^ 

ot de Of the present .nvention can be a natural or chemically synthesized product. Altemativet it ^n be sJEztl 
using a template DNA by an enzymatic reaction such as PGR lematw/eiy. can be synthesized 

,Sn.^"n "^'rirr'!?^'' ^ "'^ '""-'^"S'" ^^l^'^- "erein. a -full-length cDNA" is defined as a cDNA 

^^luTJn U '"^ ^""^ ^"'^ '"^ "'^O"- Accordk,g.y. the untranslmeJrSs 

m^ZZT °^ °' "'^'"S region in natural mRNA r^ay 2 ° a/ 

H^^J nn^ -^olated polynucleotide- is a polynucleotide the stmcture of which is not identical to that of any naturally 

TeeTe^arafeTen" ^^1°,!™ ^ T °' ^ "^""^"^ ~9 ^^^^ spanning m re t2 

mree separate genes. The term therefore covers, for example. 

Sh n'JIip^'L?"" "^^ '^""k"? °' ''^^ °' ^ "^""'^"y genomic DNA molecule but is not flanked by 

b^ofthecodingsequencesthatnankmatpart Of the moleculein the genome Of the organism in Which itnam^^^ 

S Zif ^"^^ f °^°^ted into a vector or into the genomic DNA of a prokatyote or eukaryote in a manner 

(?cro~Sfr:^r^^^^^^^ 

a recombinant nucleotide sequence that is part of a hybrid gene. i.e.. a gene encoding a fusion protein Soe- 
fected cells, or(.n) cell clones: e.g.. as these occur in a DNA librae, such as a cDNAor genomic DNA library 

l^!u^^l^^"'l!^ ^'f ^ ""^ '^^'^'^ '° * 9'^" polypeptide means that the protein or polypeotide 

s substantially free from other biotogical macromolecules. The substanually pure protein orpolypepti^is anSS 
e g., at least eo. 85. 95. or 99%) pure by dry weight. Purity can be measured 1^ any a^S st^^^S^ 
[ZZTI^- '^^°"'«'°9^^^y- PoVacylamide gel electrophoresis, or HPLC a^^^ ' 

tSnn^^H L^"' '""^"""k ^'^^ substantial^ pure human secretory protein or membrarie protein comprisinq 
T^T T^"^"^ T ^"^ '° 2-336 and SEQ ID NO: 342-350; the ID number is ate^^bte 

I^The 56 proteins out of 173 proteins of the present invention are encoded by the ^DNA clon^ sTown^riLn 
T^ese Clones were the clones isolated from the full-length-enriched human cDNA libraries «S , e^iio' 

=rhrhLts=.fnS^^^ 

iSmaJ //."" ''""^ "^'"^ ''^ '"""^'''S ^^-^""^ «^ °' wtf' a double- 

Clone name (PSEC number), 
^ length of cDNA, 

length of amino acid sequence, 

ATG No. from the 5' end, 

ATGprI value, 

definition of annotation data, 
^ Acx^ession No. of annotation data, 

P value, 

length of compared sequence, 
homology 

^n^" « "^^K ^^''^ '^"^ explicit homology as a result of BLAST 

analyse of GenBank (httpy/www.ncbi.nm.nih.gov/WeWGenBank/index html) and SwissPr^(htt^iS2„ i^S 
Smp"^'."^"™ "'^ 5' end means the posTon o Sil^^^Z 
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search." Nature Genet. 3:266-272) 

List 1 

[0034] 

PSEC0001//1992bp//226aa//1sl//0.94//GOLGI 4-TRANSMEMBRANE SPANNING TRANSPORTER MTP 
(KIAA0108)7/Ql5012//3.90E-53//221aa//46% i«ANt>PUHrER MTP 

st//0.94//Homo sapiens death effector domain-containing testicular molecule mRNA 
complete cdsy/AF043733/y3. 1 0E-37//852bp//62% "lo'^cuie mMNA. 

sapiens CLDN6 gene for claudin-67/AJ249735//5.00E-28V/ 
1^0^97^^'^^^ FK506-binding protein (FKBP63) mRNA. partial cdsV/ 

lltZ'?^^^^^^ ^^^-^^"^ ^^^-^^ CHROMOSOME ,y/ 

PSECOO12//1499bp//183aa//lSt//0 82 

PSEC6021//1 851 bp/M 1 6aa//3rd//0 82 

(v1pSSS?$^SS^^ .NTEGRAL.MEMBRA.E PROTEIN V.P36 PRECURSOR 

li^^j^S^tl^SS^r'^''''''''''''' ^^'^ ^ .-.-.-)./P37^/1.00E-aV/ai7aa//3^. 
PSEC0031//1336bp//136aa//2ncV/0.2 

ol^^!Slf^''^^'^^''°-^^'^^""°NAL OLFACTOMEDIN-RELATED ER LOCALIZED PROTEIN 
PRECURSOR (NOEL) (1B426B)7/Q62609V6.30E-33/'/373aaV28% ^AU^bU PROTEIN 

Iff^^^'^^f^^'^^^O-^'^iOSE PHOSPHATE/PHOSPHATE TRANSLOCATOR NON-GREEN 
PLASTID PRECURSOR (CTPT)y/P52178//6.60E-13//157aa//33% >mun bHbfcN 

PSEC004(y/2027bp/y2l 6aa//2ncV/0.82 
PSEC0041//2518bp//240aa//2nd//0.51 
PSEC0045//1 631 bp//372aa//1 St//0.e5 

SS^"'^'^'^*^^'^"'*'"''^'''""'™' "^^'^"^ "'"'^^'^ CCls7/AF015287//(y/ 

1^^^^^"^^^"^'°-^^^°'^° saPfens brain my047 protein mRNA, complete cdsy/AF063605//(y/ 
PSEC0051//3293bp«227aa//3rd//0.63 

PSEC0052//3635bp^/578aa//2nd«).94//AQUALYSIN I PRECURSOR (EC 3-4^1. -)7/P08594//1.60E^/348aa// 

'^''"^ --"^CURSOR (FlBROCHIMERIN)y/ 

^^l^^'^^'^^'^^^'2'*'i"0-32//l)DP N-ACFTYLGLUCOSAMINE TRANSPORTER (GOLGI UDP-GLC- 
NAC TRANSPORTER)y/Q00974//4.80E-42//314aa//31% UUK-t.LC 

"""^"'"^ '^^"^""^ """P"®*^ Cds7/AF072128//4.50E-127// 

iTaS^'''''''^^^'''''''^""^'^^''"'^''^^^'™^^''^"^^ 2.4.1.-)y/P16661//6.00E-42// 
PSEC0068//1 717bp//1 94aa//1st//0.64 

pSSSla"""^'"^ TRANSFERASE STr3 SUBUNIT HOMOLOGV/ 

PSEC0071//3558bpV875aa//1St//0.94//INTER-ALPHA-TRYPSIN INHIBITOR HEAVY CHAIN H3 PRErilR<;nn 
f4';/^7«" HYALUR0NAN-ASSOC.ATED PROTEInS^^SS 

PSEC0072//2092bp//350aa//1st//0.94//HomosaplensmRNAfor putative vacuoter proton ATPasemembrane^^ 
lorassociated protein M8-9y/Y17975//2.10E-133//622bp//99% enoranesec 
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PSEC0073//2341bp//523aa//1st//0.94//UDP<5LUCURONOSYLTRANSFERASE2Cl MICROSOMAL (EC 2 4 1 
17)(UDPGT)(FRAGMENT)7/P365l4//7.90E-71//477aa//36% ' ' 

PSEC0074//297lbp//770aa//1st//0.89//Mus musculus mRNA for semaphorin w. complete cdsy/AB02129l//Q// 
2579bp//85% 

PSEC0075//2244bp//633aa//2ncV/0.79 

PSEC0076//3253bp//860aa//1st//0.94//MITOCHONDRIAL PRECURSOR PROTEINS IMPORT RECEPTOR (72 
KD MITOCHONDRIAL OUTER MEMBRANE PROTEIN) (MITOCHONDRIAL IMPORT RECEPTOR FOR TVIE 
ADP/ATP CARRIER) (TRANSLOCASE OF OUTER MEMBRANE TOM70)y/P23231//3 80E-11//194aa/y28% 
oSoS//^^^^^^^ CARDIAC MUSCLE ISOFORMS (TNTC)7/P02642// 

PSEC0079//1 290bp//1 89aa//2ncU/0.94 

PSEC008Q//3171bp//740aa//2nd//0-94//Homo sapiens mRNA for NAALADase !l proieiny/AJ012370//(V/3l31bp// 
PSEC0081//2890bp^/1 72aa//1 st//0.94 

PSEC0082//1878bp//331aa//1st//0-94//PROBABLE OXIDOREDUCTASE (EC 1.---.-)y/Q03326//7.30E-3Q// 
269aa//34% 

PSEC0085//2392bp//280aa//1st//0.85//PROBABLE PROTEIN DISULFIDE ISOMERASE P5 PRECURSOR fEC 
5.3.4.1)7/P3866Q//5.60E-10//105aa//39% c>un itu 

PSEC0086/yi821b(V/390aa//1st//0.83//CELL SURFACE A33 ANTIGEN PRECURSORy/Q99795//2.30E-2ay 

259aa//32% 

PSEC0087//1808bp//441aa//1st//0.94//Homo sapiens G prolein^upled receptor mRNA. complete cdsV/ 

AF181862//5.40E-27//1114bpV60% . "'"M-e'e cos.^/ 

PSEC0088//2015bp//467aa//1st//0.9-l//CATHEPSIN B PRECURSOR (EC 3.4.22.1)//P07688//1.10E-39//315aa// 

34% 

PSEC0090//l722bp//543aa//lst//0.92//Homo sapiens heparanase (HPA) mRNA, complete cdsy/AF144325//a'/ 

1 722bp//99% 

PSEC0094//2291bp//564aa//1st//0.93//PROTEIN PTM1 PRECURSORy/P32857//7.10E-15//284aa//28% 
PSEC0095//2080bp^/349aa//1 SI//0.94 
PSEC009a/y21 85bp/y208aa//1 st//0.94 
PSEC0099//1 627bp//350aa//2nd//0.91 

PSEC010Q//1391bp^/172aa//1s(//0.77//Homo sapiens clone 24952 mRNA sequence, complete Cdsy/AF131758// 

7.70c-308//1 391 bp//99% 

PSEC01 01//2547bp//258aa//2nd//0.92 

PSEC01 04//1 430bp//41 8aa//2nd//0.79 

PSEC01 0y/2506bp//494aa//1 SI//0.94 

PSEC01 06//2465bp/y326aa//2nd//0.94 

PSEC01 07//2557bp//1 30aa//2nd//0.89 

qSSg'S^??^^^ ^^'^ PROTEIN C30D1 1 .06C IN CHROMOSOME 17/ 

norveglcus leprecan (leprel) mRNA, complete cds7/AF087433//CW 

2501 bp//84% 

PSEC011 0//21 79bp//344aa//1 st//0.94 
PSEC01 1 1//3362bp^/208aa//1 SI//0.83 
PSEC01 1 2//3598bp//S49aa//4th//0.74 

PSEC0113^/2451bp^/423aa//1st//0.79//36 KD NUCLEOLAR PROTEIN HNP36 (DELAY ED-EARLY RESPONSE 
PROTEIN 12) (DERl2)y/Q61672//4.20E^22//169aa//34% Mti>KUNbt 
PSEC0119//2518bp//555aa//1st//0.87//HYPOTHEriCAL 63.9 KD PROTEIN C1F12.09 IN CHROMOSOME 17/ 
01 0351//4.50E-26//240aa//30% 

PSECQ12CV/2250bp//302aa//2ncV/0.94//Human alpha-1 .3-mannosyl-glycoprotein beta-1 . 2-N-acetylglucosamlnyl- 
transferase (MGAT) gene, complete cdsy/M61829//0//2235bp//92% 

^f^^^^^]^^^^^^^^^ PROTEIN T15H9.1 IN CHROMOSOME II PRE- 

CURSOR7/Ql0005//4.10E-106//351aa//58% ^owmh rMt 

PSEC0124//1685bp//476aa//1sl//0.91//VITELLOGENIC CARBOXYPEPTIDASE PRECURSOR (EC 3 4 16-W/ 
P42660//1.10E-103//444aa//45% ' 

Z^n^oofJ^^^ ^^P'^"^ ^'^^^ " membrane protein, complete cds. clone: 

HPl03287/AB015630//4.50E-306//1433bp//98% 

PSEC0126//1906bp//l02aa//1st//0.89//Homo sapiens mRNA for leukotriene B4 omega-hydroxylase. complete 
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cds7/AB002454//3.90E-251//970bp//86% 

PS EC01 27 in TlZbplfl 1 8aa//1 st//0.94 
PSEC0128//2134t)p//306aa//1st//0.94 
PS EC01 29//1 828bp//1 35aa//1 st//0S4 
PSEC01 3(V/2934bp//2e5aa//1 st//0.68 
PSEC01 31//1 658bp//297aa//1 st//0.94 
PSEC01 33//2023bp//240aa//1 SI//0.94 
PS ECOl 34//1 898bp//1 44aa//6th//0. 7 1 

1 mRNA, complete Cdsy/AF1 1 81 Oa//0//1 640bp//99% «^ep»or LT vt 

PSEC01 36//1 907bp//392aa//1 st//0.93 
PSEC01 37//2981 bp//571 aa//1 st//0.94 
PS ECOl 39//1 361 bp/y^1 8aa//2nci/'/0.89 

PSEC0143//1976bp//125aa//1st//0.74//ENDOSOMAL P24A PROTEIN PRECURSOR (70 KD ENDOMFMRRAMp 

S^/S^r'^"''''''''^'''''""'''"''"''' Cdsy/AF151835//a// 
aToE^SS^™" ''''^^^''''' ^ PROTEIN-COUPLED RECEPTOR GPR17 (R12)y/Q,33(M// 

plISSE^SSiSr •"''"^''^^ '^^^^ - CHROMOSOME IW/ 

PSEC0149//1 988bp//432aa//1st//0 94 

:™Soir^^^^^^ ""'^"^ ^-^^ "^'^^ <^-^> -P.e,e c^y/ 

PSEC0151//1688biW467aa//1st//0.93/mSSUE ALPHA-L-FUCOSIDASE PRECURSOR fEC 32 1 "Ji^ fA, pha 
L^RJCOSIDASE I) (ALPHA-L-FUCOSIDE FUCOHYDROLASE)y/P04066//5 ^ ^ 

PSEC0l52//2130bp//374aa//2ndW).86 "'^^c .w/4{>9aa//55A 

s;o^?^)TssrrE.^^^^^^^^^^^ ^- — - 

PSEC0162//1320bp/y271aa//1st//0 83 

sisssfExrSsT'^™"^"^ ''"^"^^'^ "^™^-'c 

^?a;Sl-^SSS''° '''''''°"" PA^GENESIS-REtyVTED PROTE.N (RTVP-1 PROTE.N)y/ 

PSEC01 65//211 1 bp//242aa//1 St//0.e3 
PSEC01 67//874bp//1 03aa//7th//0-73 

''^'[^^^'^"-''^-^''Mor.o sapiens E2IG4 (E2IG4) mRNA. complete Cdsy/AF191019./(W 
PSEC0171//2005bp/y301aa//2nd//0.91 

'!^^2^lHr'^f,'"Tf''^'"''' ^'^"^ P^°'«'"-«« ^"''^noer protein 2 

(PC0LCE2) mRNA. complete Cdsy/AF098269//cy/1741bp//99% 

ooc^°l?^'^^^^ OLFACTOMEDIN-RELATED ER LOCALIZED PROTEIN 

PRECURSOR (NOEL) (1B426B)7/Q62609//6.60E-33//373aa//28% LUUALIZbD PROTEIN 

PSEC01 78//2308bp//222aa//3rcK/0.94 
PSECOl 81//1 890bp//1 65aa//3rcl//0.66 

PSEC0182//21 W/657aa//2nd//0.82//Homo sapiens mRNA for UDP-GalNAc:poVpeptide N-acetylgalactosam- 
inyltransferase 7y/AJ002744//0//2006bp//99% dcetyigaiactosam- 
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PSEC0183//2031bp//451aa//1sl//0.88//CART!L^GE MATRIX PROTEIN PRECURSOR (MATRILIN-1)y/P05099// 
5.50E-63//228aa//54% 
PSECOl 90//1 841 bp//1 94aa//1 SI//0.87 

PSEC0191//1493bp//472aa//1st//0.87//EU\STIN PRECURSOR (TROPOELASTIN)y/Pl5502//5 OOE-113/'/ 
367aa//67% 

PSECOl 92//1 557bp//1 53aa//1 st//0.93 

PSEC0197//3555bp//576aa//2nd//0.85//PEROXISOMAL-COENZYME A SYNTHETASE (EC 6 - - -)7/P38137// 
1.30E-33//169aa//32% \ ^ , . ;7/rooJ0/// 

PSECOl 98//2083bp//343aa//1 St//0.94 
PSECOl 99//2586bp/^83aa//1 st//0.94 

PSEC020Q//1548bp//443aa//1st//0.94//Mus musculus immunosuperfamily protein B12 mRNA cx)mDlete cds7/ 
AF061260//4.30E-243//1297bp//89% ' ^•'^ 

PSEC0203//1 457bp//323aa//1 st//0.87 
PSEC0204//1 484bp//1 42aa//1 St//0.74 

PSEC0205^/1656bfV/435aa//lst//0.94//CELL DIVISION CONTROL PROTEIN 91 y/P4 1733^/7. 70 E-41//290aa// 
33% 

TmI^m^^ ^ '^'^"^ multtepanning nuclear envelope membrane protein nurim 

(NRM29) mRNA. partial cds7/AF143676//0.00E+0Q//1399bp//99% 

PSEC0209//2144bp//186aa//1st//0.93//Homo sapiens Pancreas-specific TSA305 mRNA complete cdsV/ 
AB020335/yQ//1770bp//99% ^ 

PSEC021 0//1 689bp//349aa//1 st//0.7 1 
PSEC021 3//1 824bp/y323aa//1 st//0.94 
PSEC021 4//1 959bp//1 41 aa//1 st//0.94 

AFoT^Sm^ObSe^^^^ ^^^'^"^ precursor. mRNA, complete cds and 3' UTRV/ 

PSEC021 6//1 765bp//41 0aa//2nd//0.89 

L^i^^^f ^^^^^^"^^^^^^^^^^^'^^^^^"^^ ^P'^"^ (^^1) "i™A, complete cdsy/AF007871//3 10E- 

26//619bp//61% 

PSEC022(y/1584bp/y365aa//1st//0-94//Mouse Wnt-6 mRNA. complete cdsy/M8980{y/5.50E-19a//1310bD//82% 
PSEC0222//899bp//139aa//2ntV/0.94 "i^P'/o.^vo 
PSEC0223//1874bp//221aa//1st//0.94 

PSEC0224//1463bp//170aa//1sl//0.89//UROMODULIN PRECURSOR (TAMM-HORSFALL URINARY GLYCO- 
PROTEIN) (THP)y/P48733//8.30E-1 a//1 41 aa//36% 

^ PSEC(^6//2l03bp^/477aa//1st//0.94//Mus musculus carboxypeptidase X2 mRNA. complete cdsy/AF017639// 

PSEC0227//1410bp//379aa//2ncV/0.8l//Cricetulus griseus SREBP cleavage activating protein (SCAP) mRNA 
complete CdsyAJ6706(y/2.50E-231//1O99bp//84% 

r^!A?!^Mo'*^?^'^^^'^^'''°-^^''^°''-^°^^° VESICLE MEMBRANE PROTEIN P24 PRECURSOR 
40 (P24A)(RNP21.4)y/Q63524//5.90E-21//110aa//32% v^unouH 

PSEC0230//1784bp//271aa//1st//0.76//SIGNAL RECOGNITION PARTICLE RECEPTOR BETA SUBUNIT (SR- 
BETA)y/P4775aV5.80E-123//271aa//90% 

PSEC02^/1709bp//246aa//1st//0.75//30 KD ADIPOCYTE COMPLEMENT-RELATED PROTEIN PRECURSOR 
(ACRP30) (ADIPOCrTE SPECIFIC PROTEIN ADIPOQ)y/Q60994//3.30E-24//242aa//32% 
« PSEC0233//2499bp//267aa//1sl//0.82 
PSEC0235//1 601 bp//211 aa//1 St//0.94 

P^f^^SOOE'SSSleS^^'^'^^^^^^ GAMMA-1 CHAIN PRECURSOR (Ly\MININ 82 CHAIN)y/ 
^ PSEC0240//1638bp//253aa//1st//0.94//WNT-11 PROTEIN PRECURSORy/096014//3.40E-109//220aa//93% 

PSEC0243//2835bp//743aa//3rd//0.77 
PSEC0244//2063bp/^87aa//1sl//0.91 

PSEC0245//2896bp//418aa//3rd//0.91//INTEGRAL MEMBRANE GLYCOPROTEIN GP210 PRECURSORy/ 
P1 1 654//3.40E-205//483aa//78% 

PSEC0246//2969bp//345aa//1st//0.94//LOW-DENSITY LIPOPROTEIN RECEPTOR- RELATED PROTEIN 2 PRE- 
CURSOR (MEGALIN) (GLYCOPROTEIN 330)y/P98158//1.60E-22//126aa//42% 
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rw5.^^''''^^^^'^'^^^''^''^°-5^"'LATELET-ENDOTHEUAL TETRASPAN ANTIGEN 3 (PETA-3) JGP27^ 
(MEMBRANE GLYCOPROTEIN SFA-1) (CD151 ANTIGEN)7/035566//3.30E-28//237aS% ' ^ 

PSEC024a//2694t)p//172aa//1st//0.84 -^"'^o/ddz/^a /, 

GASTBULATION PROTHIN PRECURSORy/P54356^/1.50E- 

sSaSE^jssr''^™"""'^ ' '"^^^^"^ ^'^^^-^^''^ "^^^"^^-^'^ 

rj^^ o^^owi'^'^^^'^^'''°-^^^''"°S''"'^""°YLIN0SIT0L-4-PH0SPHATE S^KINASE TYPE II ALPHA 
(EC 2.7.1.68) (PIP5KII-ALPHA) (1-PH0SPHATIDYLIN0SIT0L-4-PH0SPHATE KINASETrPTDINS 4iP 
NASE B ISOFORM) (DIPHOSPHOINOSITIDE KINASE)7/070172//,.30E-139//2i)aaJ^S? (PTDINS(4)p.5-KI- 

SeSskss;^^^ 

2^3^^^^'''^^''^^^'^"'^'°"°"''^'^°^""°"^ (CYTOCHROME B-561)y/Q95245//3.70E-44// 

PSEC0260//2492bp//496aa//1 SI//0.94 

PSEC0261//3080bp//B06aa//2ncy/0.76'/MITOCHONDRIAL PRECURSOR PROTEINS IMPORT RECEPTOR m 
KD MITOCHONDRIAL OUTER MEMBRANE PROTEIN) (MITOCHONDRIAL IMPORrTOTOR fJ^™^^ 

P&EC0084//2788bp//335aa//1st//0.86^/IMPLANTATION-ASSOCIATED PROTEINy/035777//1.80E.167//335aa// 
PSrc0237//1419bp^/248aa//1st//0.81//Homo sapiens CTGIa mRNA, complete cds7/U80744//8.30E-22//556bp// 

PS EC0264//261 7bp//1 57aa//1 SI//0.94 
PSEC0265//2646bp//192aa//1sl//().76 

L'^c^If«L°." 11^'°"^ '^'^"^^'y 'o*s«>fe in the ATGprI (PSEC0017. ATGprI 0.33; PSEC0030 ATGorl 0 26- 
^C0M1 ATGpr, 0.20: PSEC0049. ATGprI 0.35): These clones, in which dS of me S'-end wS^neS 

?onS hL?,htl„ ^ " ""'^ '^"""^ "^^'"^ '=ONA Clones was analyzedllui the 

Clones have the signal sequence m the N-tenminus. In addition, the above 4 clones except PSEC0049 had Dortio^ 

EST^ntn pI T f,^^'"^^'-"'^ to known EST PSEC0049 h1^ port^s^ot c^S h 

wS:u«"re^^^^ 

sS in Ust T^PsS?Xrl^V 1^ P^"^"' "^'^ '=°NA clones as 

snown in List 2 (PSEC0027. PSEC0047. PSEC0066. nnnnnnnn. PSEC0069 PSEC0078 PSEC00q2 P'iFnnKvi 
dS"'- ^Si^°'^^':S^C02,2. PSEC0239, PSEC0242. PSEC0251, and PSE^.^STne^ wSpS 
R Ld^nT f M^Td'T' P^^'-'" («'"'a"""9 '"e transmembrane helix) by the MEMSAT (Jones D.T T^^r w 

Tus the Inp?^ ^^'^r ^ Bioinformat«s 14: 378-379) (Mitsuf lnforn«,ion D^Sn, i^) 

h , „7'^ r^*"^ """'"^ ""^ '^^^'^ "'''^"^ constructed by the olig<«apping methS K 

and SOSIJI The proteins encoded by the clones are also classified into the category of a secretory proteins wto^h 
brane proteins described above. Two clones among the 15 clones (PSEC0242^d PsIS^K JSS^ 

p2n?c k ^ "^^'^ '^"^^P^)' '"^ ^^""'"9 P'°'ein will contain a signal sequence in the N- 

ten^mus. Accordingly. ,t « possible that the two clones are classified into the category of secretoiy proteins or mem- 
brane proteins that contains a signal sequence In N-terminus w-eoor proteins or mem- 

.Klnrt^frr "^'"^ ''^^^ °' '^"S"^ °' a-"'"" sequence. ATG No from 

MEMsTt a^^SOSuriS^^^^ ''""^"'^^ "^^^''^'^'^ ^-"^ '0^ --brane pro^ei by - 

cl„ f a"'' SOSU . definition of annotation data. Accession No. of annotation data, P value length of comoared 
sequence, and homology in.this order, separating each of these with a double-slash mark // 

G.IZT1^Z T^^T f ""P'^' ^ « -^^"l' o' BLAST analysis of 

stS^r^JSSZetlr '"^^^^^^ (h«P«i.ac.u./ebi!doc^ 
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Ust2 
[0037] 



PSEC0027//1 085bp/y27l aa//1 st//0.94//No//transmembrane 

r.!!i^rf ^^^^^^^ MEMBRANE PROTEIN 2B fTRAN^ 

MEMBRANE PROTEIN E3-16)y/042204//1.80E-55//264aa//44% ^^HUIEIN 2B (TRANS- 

nnnnnnnn//2105bp//730aa//1st//0.26'/No/Aransmembrane//VERY-LONG-CHAIN ACYL^JOA SYNTHFTACip ,pr 
6^.1.) (VERY-LONG-CHAIN- FATTY-ACID-COA L.GASE)y/035488//2 SM^SaSiy ' 
PSEC0069//2568bp//433aa//2n(WD.94//NQ//iransmembrane i40ff520aa//45/, 

lSin\7NS^^'''^lT''''T^'"''''^^^^^ ^^'■^"^ neuroendocrine-specific protein-like 

protein 1 (NSPL1) mRNA, complete cdsy/AFl19297//0//2524bp//99% 

rp!,t?hl!rCl'^''f''^K?*''^''''^°'^^^^^^ lipotysis-stimufated remnant re- 

ceptor beta subunil mRNA. complete cdsy/AF119669//2.00E-221//1048btV/76% 

- SIS^SSSLT'''"""^^"''''"^^^ ^ PROTEIN-COUPLEO RECEPTOR 

pTeS'S^r^rbStr "''^^^^^^ '^^^ corn- 

SyM?^;;:r7f^^^^^^^ -P^'^- ^--e mRNA. complete 



10 



15 



25 



30 



45 



50 



55 



PSEC0242//3017bp//401aa//lst//0.9//No//transmembrane 
PSEC0251//2372bp//393aa//1st//0.78//No//lransmembrane 



?H^.'f^^f?^°'^^^'^^^'^^'°-^^''^°'^^"®"'^'"''^«"^^0'n«> sapiens protocadherin alpha 12 rPCDH 
alpha12) mRNA. complete cdsy/AF152308//Q//3520bpV99% Potocaonenn alpha 12 (PCDH- 

PSEC0078//2194bp/^a//2ncl'A).24//NQ/Aransmembrane//M-Sema F=a factor in neural network d«oinnm<.„r 
[mice, neonatal brain. mRNA. 3503 nt]y;S79463//1.50E-282//l945bp//83% development 



(Annotation 1) 



[00381 Clones with relatively low score in the ATGpri (PSEC0239 ATGon o mv pciprno-io ... 

w,m known ESTs revealed that the ctone has a ponion not contained in ESTs in the 5'-end of the cDNA. 
(Annotation 2) 

[0040] Herein, "SP-Yes" means thata signal sequence is present at the N-terminus. predicted by the PSORT. 

(Annotation 3) 

S ^ ? "'"""''"^'^ "^"^ ^ "^'"9 ESTs as criteria lor the judgment °' 

^(P^CollsTd'SSoeTAl^r::^ 

, H . K PSEC0206). As a result of the homology search of the SwissProt PSEC01 95 and P«;PCo?nfi 

werefoundtohaverela..ely high homology With mouseplasmamembraneadap.erHA2/AfladS^^^ 
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TJ^T^ai^n^^T'^ H precursor (prohormone processing carboxypeptidase) in .he secreton. granule, re- 
spectively. Accordrngly, the protems are classified into the category of secretory proteins or membrane proteins. 



Lists 



th^ 5 Pnd frrni ? ""'"^'"'"'^^'^ f'SEC number, length of cDNA. length of amino acid sequence. ATG No. from 

MEMSAT kS^nc^nf ".'-.''r "^O"^' P^^**^'^'^ for membrane protein by 

MEMSATand SOSU . definition of annotation data, Accession No. of annotation data, P value, length of compared 
sequence, and homology in this order, separating each of these with a double-slash mark, //. compared 

PSEC0195//1979bpV467aa//2nd//0.80//No//No//ALPHA-ADAPTIN C (CLAWRIN ASSEMBLY PROTFIN row 
HA2/AP2 ADAPTIN ALPHA C SUBUNIT)y/P17427//1.8E-144//281aa/y98% MtMbHANE ADAPTOR 

^fS2^J"^^^P'^'^^'^''^'0.9a/M/No//CAT^BOy<yPEPTlDASE H PRECURSOR (EC 3 4 17 10) (CPH> 

an appropnate expression system, or by using a spedfic antibody against it Dinant protein utilizing 

f!J mi,hi^°c '^i?' ^^^""^ °' ^ P^"*^'" °' membrane protein can be analyzed according to 

he methods descnbed in "Glycobiology- (Fukuda M.. and Kobata A. edit., (1993)). "Growth Factora" (McKay I and 

Approach (IRL PRESS), or Glycoprotein Analysis in Biomedicine" (Hounsell E.F. edit.. (1993)) in the series of 'Method 

ments Vo^.7 Growtt, and differentiation factors and their receptors" (Japan Biochemistry Society edit (1991)) fToLo 
m^^^TZ .'^f.^- Ne"™'^«"smit,er Transporters". "Vol.294: Ion Channels (Part C)". "^01.293 on ChaS 

M9-^olT' T '^"'-^^'^ BiomembranesVVoL19,: Biomembran^^!^™ 

the bS^^niL. IkT", "^""^ "^"""^ °' Enzymology (Academic Press) may be used 0 analyze 

n the search of the Online Mendelian Inhentance in Man (OMIM) (httpy/www.ncb..nlm.nih.aov/Omim/ > using the fol 
lowing keywor*. the results obtained with each keyword, suggest t he association of the proteins with m anJSe^^ 

^dS P™'"'"^ ' '^'9^' ^« 

[00461 New information is constantly updated in the OMIM database. TTierefore. it is possible for one skilled in the 

^findanewrelationshipbetweenaparticulardiseaseandageneofthepres^ 

[00471 Keywords used in the search of the OMIM i^jmeu udwoase. 

(1) secretion protein 

(2) membrane protein 

f^™? H^^t'^J" '"^ ^^^"^ '^"^ """y '"^ action numbers in the OMIM. Using the number data showina 
the relationship between a disease and a gene or protein can be seen. TTie OMIM data has been renewal eve^I' 

1) Secretion protein 

268 entries found, searching for "secretion protein" 

1^675 JS' \ZT- 603215. 157147, 600174. 
1^^^" ™ ^ ^' 179509.246700. 179510, 600626. 179511.600998. 109270 601489 

I^f • o^f' l^'^^go. 602672. 146770. 603062 179508 13123o'60 SgJ' 

602421. 139250. 167805, 167770. 600041 . 600564. 118825. 601146. 300090 600753 601652 6007^' 

3o^' JL'S? ^B^J So! 'S' 

308700. 134797, 137350, 227500. 176300. 107730. 600760. 138079, 120180, 120160 120150 124092 138160 

o,tn 2^ 120140. 162151. 158070, 152790 120120 1061 Oo' 300200 

192340. 190160, 138040. 147470. 147620. 173350. 147380. 152200. 152760. 157145 153450 26408o' nS" 
600937.600840. 188545.202110.600514. 186590.603372. 136435. 137241 252800. 2S 20^50 1^85^ 
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139191 . 142640, 138130. 189907. 603692. 600633. 603355. 107270, 600377. 147892. 232200 600281 232B00 
602358. 137035, 601771. 601769. 253200. 601933. 118444, 600270. 120700, 600945. 603732 147660*600761* 
172400. 600823. 600877. 130080. 171060. 107740. 307800. 602843. 130660. 152780. 124020 6OII24' 601340* 
601 604. 601 61 0. 1 71 050. 31 2060. 232700, 3001 59. 1 42703. 600734. 1 25255. 1 68450 1 2381 2 ' 1 8854o' 1 4794o' 
188450, 600839. 182452. 188400. 182260, 176760. 263200. 600264, 188826. 252650. 601 185* 162641 * 137216* 
601398. 601538, 118888. 118445. 601745. 190180, 601922. 182098. 602008. 147440, 602384* 600031 ' 1 09160* 
602653. 151570.602682.602730.602779. 146880,603061. 142704.603140. 106150.600732 153620*603318* 
1 39392. 600042. 1 02200, 603493. 1 821 00, 264300. 603795, 1 84600 

2) Membrane protein 

1017 entries found, searching for 'membrane protein" 

130500.305360, 153330, 173610, 1 70995, 109270, 170993,309060, 120920,602333, 133740, 133710 602690 
133730. 159430. 600897, 133090. 601178, 602413. 602003. 109280. 603237. 602173 107776* 602334* 125305* 
602335. 182879. 154045, 309845. 600594. 603718. 603241. 603214. 603657. 603177. 600182* 601476* 602879* 
136950, 600723, 601114, 185880, 185881,300096, 602257. 160900, 177070. 603062, 603344 602977*310200* 
600959, 300100. 186945.600039. 600267. 128240. 182900. 601097, 136430. 600946 602534* 601047* 143450* 
6031 41 . 603700. 600579. 256540. 1 59440. 60241 4, 600403. 602048. 1 88860. 1 37290, 1 58343* 1 84756* 60291 0* 
603179. 600279. 108733. 107770. 1 73335. 602625. 154050, 219800. 603850. 601028. 600447 104225* 186946* 
601 767, 6031 43. 121 01 5, 60321 5, 227400, 603735. 6001 79, 602421 . 1 80721 , » . . 

176801. 176860.600753.603142. 176790, 600265. 601239. 115501, 143890, 121014, 121011 125950 603534 
304040, 601134. 600754. 601510. 601595, 190315, 300172, 602216. 602261, 602262, 602461* 131560* 179514* 
179512, 176981. 142461 . 139310. 312080, 176640. 128239. 185470. 310300. 601403, 601757 273800* 151460* 
1 76943, 1 0431 1 . 1 68468. 1 201 30. 602887. 6001 64, 601 531 . 601 832. 1 04775, 600040. 603583 1 76894* 602631 ' 
166945. 182180. 120620, 141180. 601014. 139150. 182860, 177061.600174, 180069. 191275 104760*601693 
300017. 603518, 601009, 134651. 601107, 603868. 600168. 136425, 603531, 603291, 600917, 603216* 102720* 
300118. 179590. 135630. 602285. 107150. 602296. 303630. 176878. 120090. 600322 138160 601212* 603293* 
131230. 112205. 600763. 600718, 300187. 170715. 601966, 300051. 602474. 

1 20070. 600691 . 600855, 1 82309. 6021 01 . 602857. 1 94355. 1 62230. 600874. 'l 1 3730. 1 55550. 602701 306400 
501789,231200, 107271. 175100. 182870.305100,301000. 601313, 157147. 147670. 139200 603593* 157655* 
600934. 155970. 602049. 155960. 155760. 118990. 135620. 308230. 602694, 162060. 300023 160993* 153619 
153432, 120131, 603823, 603167, 601023,600816. 165040. 601681 , 166490. 300112, 120 190.* 300 145* 163970* 
600772. 602926, 602933. 602202. 400015. 151510. 600759. 602672. 602654, 603821. 116952 151430* 602632* 
155975. 602217. 150370. 600752. 601119, 600932, 603048, 603234. 601805. 603822, 603869 601717* 601181 ' 
313440, 139130, 107777. 109190. 603452. 191163. 191164. 602370. 176877. 103195.600523. 191328*601275 
204200, 602426. 60381 0, 600551 . 600695, 600552. 600553. 602306, 601 523, 

602507. 602299. 600583. 114070. 600632. 603498. 185430. 600587. 235200. 173470. 603199 601633 602500 
208900. 180297. 156225, 516020. 190195. 141900. 102680, 193300, 101000, 193400. 300011 107400*257220* 
107741 , 180380. 203200. 111700. 600024. 304800. 600065. 110750. 179605. 113705. 601638*222900* 120120* 
602509, 602469, 600930, 601 383, 1 76261 , 602574. 602997. 31 1 770, 1 31 550. 60361 6. 308700 603372* 2561 Oo' 
224100. 276903. 305900. 516000. 131195, 314555. 601567. 603866. 306900, 103390. 186720. 173850* 601050* 
602505. 186590.246530.602689. 194380,300041. 162643. 152790. 120150.600682.600106 272750* 188040* 
602382. 601497. 113811. 182138. 212138. 601309. 109690. 114760, 176805. 601253. 123900,' 602581 ' 189980* 
191190, 110700. 600163, 137167. 600580. 601610, 190000, 123825. 603491, 

600135. 186591. 173910. 138140. 107266. 120950, 601081 . 603690.'244400. '312700. 171060 601199 601758 
170500. 277900. 601997. 314850. 601880, 603009, 120220, 603126. 164920. 602934. 1 64730 163890* 603434* 
107269. 602909. 600877. 256550. 164761, 602872. 120110, 126150. 158010. 266200. 223360 250800* 269920* 
252650. 603355. 154582. 138190. 300035, 602640. 227650. 158120. 153700. 182380, 155740 204500 603401* 
601975.300135. 136350.602924.300167, 185050. 176100.300189. 151525.300200. 165180 230800*502158 
602676. 603411, 193245, 120325. 601848, 192500, 603102. 147795. 245900. 137060, 147557. 120650*602317* 
307800. 120930. 308100, 142800. 191092. 232300. 173510. 602225. 180470. 190930. 186357 134638* 600544* 
601373. 600509. 600359. 603784. 600395. 600653. 603754, 601 597, 601 066 

600185. 501295. 600978. 205400. 603274. 600418. 600839. 516050. 601691 . 601007. 600650 600308 603261 
601193.600004.600017.516040,253800,276901.600019,257200, 108780,300037 300104 300126*255125* 
203300, 300191. 426000. 302060. 304700. 201475. 252010. 193210. 311030, 306250.248600* 191740* 108360* 
131244.600423.232200.191305,231680, 103320. 190180,600493. 111200,226200,312600 600170 111680* 
186910.203100,600536.600238, 186830, 186760,186745.186711, 106180.112203 103180*182530*182160* 
600644. 307030, 192321, 600667. 125647. 179080, 114207. 114860. 176000, 116930. 500748 173515* 173325* 
600377. 171760, 171050. 118425. 170260, 191315.600798.600821.600823.600444.600840 159465*600857* 
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158380, 600867, 154360. 152427. 150330. 110900. 147840, 147360, 147280. 

146880. 312610. 120940. 142871, 142790. 600937. 142600. 134390.1 11 250. 600979. 600997. 142460. 186845 
134635. 601017. 139191. 139090. 138850.601040. 138720. 122561, 131100. 123610.217070.' 100500 603377' 
602354. 603302. 603207. 603086. 602188. 602095. 603867. 603842. 603798. 602602. 601 194, 602607. 603713 
603681 . 601252. 603648. 603645. 603644. 601282. 601284. 603667, 603712. 603594. 601872! 603425* 601843' 
603263. 603208. 601411, 603201 . 603189, 501463, 603164. 603152. 603087. 602874, 601492,' 602893* 602057* 
602859. 602746, 603879. 603510. 602458, 603380. 601581. 603765. 603283. 601599. 601733,' 601852! 602316* 
601615. 601 617. 602184. 602894. 603005. 603030, 603861. 602835. 602136, 600153. 600074] 600046* 600023* 
601625, 516006. 600018, 600016. 516002, 601590, 313475, 313470. 600244. 

600528. 601611. 600282. 600327. 601568. 600368. 601130. 601535. 601745. 601929. 300169. 300150 300132 
601533. 600385, 600464. 600421, 600429. 601156. 601488. 516005. 251100. 516004. 600918. 516003* 602192* 
516001 , 240500, 600465. 602241 , 602243. 230200. 601485. 601478, 601416, 602297. 601459! 601839* 602314* 
193065, 193001, 191306. 600504. 601020. 191191 . 602372. 190181 . 600534, 188380, 186854! 186360* 60053o' 
185250. 182331 . 600535. 182305. 601296. 600582. 600732. 600734. 600742. 600782. 176802! 176266* 600769* 
601 883. 600864. 601 901 , 1 76260, 1 73490. 60091 0. 601 905, 1 71 890. 60091 6. 601 987, 602679! 1 62651 * 1 61 555* 
1 60994. 602714. 602715. 602724, 602736. 300007. 602783. 275630, 602836. 270200. 602871 ' 159460* 602876* 
154540. 153900, 602890. 601153. 602190. 602905. 153634. 153337. 602914. 

152310. 151690. 151625. 602935. 602974. 150325, 602992. 150320. 250790. 603006. 603007. 603008. 150292 
233690. 603046, 150210. 603061, 147940, 603053. 221770, 223300. 603097. 147880, 603118. 14773o! 146928' 
146630. 142622.603149. 603150, 603151,600923. 138981. 138590. 138330.216950.603192. 138297*603202* 
601002, 602343, 138230. 136131. 603217. 603220. 134660. 131390, 131235, 603242, 603243, 13013o! 602345* 
126455. 601123. 126064. 125240, 602359. 603312. 602380. 603318. 123890. 123836, 603356, 603361 ! 603366* 
123830. 179610. 188060, 123620, 120980, 186355. 118510. 114835. 1 14217. 113810. 603499. 182310. 111740* 
109610. 603548. 603564, 108740, 603598. 603613. 107273. 603626. 602518. 179410, 603647,* 60251 5 603652' 
106195, 602573, 178990. 105210, 104615. 167055, 603717. 104614. 603728. 

1 0421 0, 603749. 603750, 1 03850, 602608. 603787. 603788. 603796. 'l 73445. 1 03220, 1 0291 0 1 02681 1 02670 
1 02642, 603833. 1 73391 , 1 02576, 1 02575. 1 71 833, 1 02573, 1 01 800, 603875. 6011 08 

[0049] -mere are several methods for analyzing the expression levels of genes associated with diseases. Differences 
in gene expression levels between diseased and normal tissues are studied by the analytical methods, for example 
Northern hybridization and differential display. Other examples include a method with high-density cDNA filter, a method 
with DNA microanray and methods with PGR amplification {Experimental Medicine. Vol.17. No. 8. 980-1056 (1999)- 
Cell Engineering (additional volume) DNA Microarray and Advanced PGR Methods, Muramatsu A Naba (eds.). Shu- 
junsya). TTie levels of gene expression between diseased tissues and normal tissues can be studied by any of these 
analytical methods. When explicit difference in expression level is observed for a gene, it can be concluded that the 
gene is closely associated with a disease or disorder. Instead of diseased tissues, cultured cells can be used for the 
assessment. Similarly, when gene expression is explicitly different between nonmal cells andcells reproducing disease- 
associated specific features, it can be concluded that the gene is closely associated with a disease or disorder. When 
the expression levels of genes are evidently varied during major cellular events (such as differentiation and apoptosis) 
the genes are involved in the cellular events and accordingly are candidates for disease- and/or disorder-associated 
genes- Further, genes exhibiting tissue-spedfic expression are genes playing important parts In the tissue functions 
and. therefore, can be candidates for genes associated with diseases and^or disorders affecting the tissues. 
[0050] For example, non-enzymic protein glycation reaction is believed to be a cause for a variety of chronic diabetic 
complications. Accordingly, genes, of which expression levels are elevated or decreased in a glycated protein-depend- 
ent manner, are associated with diabetic complications caused by glycated proteins (Diabetes 1996. 45 (SuppI 3) 
S67-S72; Diabetes 1997, 46 (Suppl. 2). S19-S25)- The onset of rheumatoid arthritis is thought to be involved in the 
proliferation of synovial cells covering inner surfaces of joint cavity and in inflammatory reaction resulted from the action 
of cytokines produced by leukocytes infiltrating into the joint synovial tissues (Rheumatism Information Genter. http:// 
www.rheuma-netorjpO- Recent studies have also revealed that tissue necrosis factor (TNF)-a participates in the onset 
(Current opinion in immunology 1999, 11. 657-662). When the expression of a gene exhibits responsiveness to the 
action of TNF on synovial cells, the gene is considered to be involved in rheumatoid arthritis. Genes associated with 
neural differentiation can be candidates for causative genes for neurological diseases as well as candidates for genes 
usable for treating the diseases. 

[0051] Clones exhibiting differences in the expression levels thereof can be selected by using gene expression anal- 
ysis. The selection comprises, for example; analyzing cDNA clones by using high-density cDNA filter and statistically 
treating the multiple signal values (signal values of radioisotope in the radiolabeled probes or values obtained by meas- 
uring fluorescence intensities emitted from the fluorescent labels) forthe respective clones by two-sample t-test, where 
the signal values are determined by multiple experiments of hybridizaUon. The clones of interest are selectable based 
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on the slalisically significant differences in the signal distn-bution at p<0.05. However, selectable clones with significant 

difference ,n the expression levelsthereof may bechangeddependingonthepartialmodificationof statistical tre 
For example, the clones may be selected by conducting statistical treatment with two-sample t-test at p<0 01 • or oenes 
e(h,b.t,ng more explicit differences in the expression levels thereof can be selected by perfomting statistical "treatment 
with a preKJetermined cut-olf value for the significant signal difference. An alterr.ative method is that the expression 
levels are s,mply compared with each other, and then, the clones of interest are selected based on the ratio of the 
expression levels thereof. 

[0052] Clonese^hibiting differences in the expression levels thereof can also be selected by comparing the expres- 
sion levels tv ^R analysis, for example, by using the method of detemtining the band intensities representing the 
amounts of PCR products with ethidium bromide staining; or the method of detemiining the values of Si^Sope 
f ""orescence intensities of the probes hybridized to the PCR products when radiolabeled or fluorescent dye- 
labeled probes, respectively, are used in the hybridization. If the expression level ratios obtained in multiple PCR 
ecpenments are constantly at least 2-fold, such a clone can be judged to exhibit the difference in the expressfon level 
the eo . When the rat«,s are several-fold or not less than 10-fold, the clone can be selected as a gene exhibiting the 
« explicit difference in the expression level thereof. ""ngine 
SriJ?/"T °' "^'"^ expression levels are varied in response to TMF a (Tumor Necrosis Factor-alpha) 

m the pnmary cellculture of synovial tissue delected the following clones with elevated expression levels in the presence 

^ PSEC0084. PSEC0100. PSEC0109, PSEC0120. PSEC0131. PSEC0161 PSEC0183 

pSS pSS: "s^'^^a^: pSS 

p'^^L^"^^ """y '"""^ "^'^ decreased expression levels in the presence of TNF a are PSEC0105 and 
HfaEC0245. These clones are candidates for rheumatoid arthritis-associated genes 

25 T!^^ A survey of genes of which expression levels are varied in response to the stimulation for inducing cell dif- 
f 1!? "^'"9 ^^""O*^ (RA)) in cultured cells of neural strain, NT2. detected the following clones 

with vaned expression levels: PSEC0005. PSEC0O48. PSEC0059. PSEC0200. and PSEC0232. These are important 
genes assoaated with neural differentiation. TTie following clones also had varied their expression levels- PSEC001 7 
PSEC0019. PSEC0021. PSEC0030. PSEC0041, PSEC0047. PSEC0049. PSEC0055. PSEC^" PSeS 
PSEC0071. PSEC0072. PSEC0074. PSEC0075. PSEC0076. PSEC0080. PSEC0081 PSECOO&S" PSeS 

30 PSE^094. PSEC0103. PSEC0104. PSEC0105. PSEC0112. PSEC0,13 PSEG0117 PSECo!^' pIIS 
p^pS^f • of ''SEC0143, PSEC0144. PSEC0152 PSEC0161 PSeS ^' 

PSEC0171, PSEC0181, PSEC0182. PSEC0192. PSEC0195. PSEC0203. PSEC0215 PSEC0223 PSeS2^' 
PSEC0239. PSEC0243.PSEC0251.PSEC0255.PSEC0265. 3. rotuv^j. Kt,bU)235. 

^^J^KJ^^!^""^^'^ associated with neural differenhation and. therefore, are candidates for genes associ- 
ated with neurological diseases. "^"^ 

po^^^Based on the functional analyses using a secretory protein or membrane protein, it is possible to develop a 

in case of a membrane protein, it is most likely to be a protein that functions as a receptor or llgand on the cell surface 
Tberefore. it is poss^e to reveal a new relationship between a ligand and receptor by screening the membrane protein 

totlIe'Zr,IleS°" 

f°^58I For acample. a ligand against the protein of the invention can be screened in the following manner NameV 

t.SniP vl^h .h' I° ^ l^'"" ^ "^^"^ ^ ^ '^'^'^ comprising the steps of: (a) contacting a tesi 

samp e «^th the protein of the invention ora partial peptide thereof, orcells expressing these, and t) selectir^ a test 
sample that binds to said protein, said partial peptide, or said cells. W 5«eaing a test 

pwsgj On the other hand, for example, screening using cells expressing the pretein of the present invention that is 
t^Tnr r P^f'^ed as follows. It is possible to screen receptors that is capable of binding to a 

specific protein by using procedures (a) attaching the sample cells to the protein of the invention or its partial p^tide 
and (b)selecung cells that can bind to the said protein or its partial peptide 

[0060] In a following screening as an example, first the protein of the invention is expressed, and the recombinant 
r.,mZ''^n H f IT"'^ "'"'""9 assay is perfbrmed using a various cell lines or primary 

^^1^ . f t^'^ expressing a receptor are selected (Growth and differentiation factors and their re- 

ceptors, Shin-Se.kagaku Jikken Kouza Vol.7 (1991) Honjyo, Aral. Taniguchi. and Muramatsu edit, p203-236 Tokyo- 
Kagaku-Doujm) A protein ol the invention can be labeled with Rl such as i25|. and enzyme (alkaline phosphatase 

againsttheprotein. The cells th^ 

1^!. r "'^'^ '°' '"^ '^"^^^ ^^^^"'"9 °' ^ agonists or antagonists of the said receptor 

(0061 J Once the ligand binding to the pretein of the invenOon. the receptor of the pretein of the invention or the cells 
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expressing the receptor are obtained by screening, it is possible to screen a compound that binds to the ligand and 
receptor. Also it is possible to screen a compound that can inhibit both bindings (agonists or antagonists of the receptor, 
for example) by utilizing the binding activities. 

[00621 When the protein of the invention is a receptor, the screening method comprises the steps of (a) contacting 
the protein of the invention or cells expressing the protein of the invention with the ligand, in the presence of a test 
sample, (b) detecting the binding activity betv^^een said protein or cells expressing said protein and the ligand, and (c) 
selecting a compound that reduces said binding activity when compared to the activity in the absence of the test sample. 
Furthenmore. when the protein of the invention is a ligand. the screening method comprises the steps of (a) contacting 
the protein of the invention with its receptor or cells expressing the receptor in the presence of samples, (b) detecting 
the binding activity between the protein and its receptor or the cells expressing the receptor, and (c) selecting a com- 
pound that can potentially reduce the binding activity compared to the activity in the absence of the sample. 
[0063] Samples to saeen include cell extracts, expressed products from a gene library, synthesized low molecular 
compound, synthesized peptide, and natural compounds, for example, but are not construed to be listed here. A com- 
pound that is isolated by the above screening using a binding activity of the protein of the invention can also be used 
as a sample. 

[0064] A compound isolated by the screening may be a candidate to be an agonist or an antagonist of the receptor 
of the protein. By utilizing an assay that monitors a change in the intracellular signaling such as phosphorylation that 
results from reduction of the binding between the protein and its receptor, it is possible to identify whether the obtained 
compound is an agonist or antagonist of the receptor. Also, the compound may be a candidate of a molecule that can 
inhibit the interaction between the protein and its associated proteins (including a receptor) in vivo. Such compounds 
can be used for developing dmgs for precaution or cures of a disease with which the protein is associated. 
[0065] Secretory proteins may regulate cellular conditions such as growth and differentiation. It is possible to find 
out a novel factor that regulates cellular conditions by adding the secretory protein of the invention to a certain kind of 
cell, and perfomning a screening by utilizing the cellular changes in growth or differentiation, or activation of a particular 
gene. 

[0066] The screening can be performed, for example, as follows. First, the protein of the invention is expressed and 
purified in a recombinant form. TTien, the purified protein is added to a various kind of cell lines or primary cultured 
cells, and the change in the cell growth and differentiation is monitored. The induction of a particular gene that is known 
to be involved in a certain cellular change is detected with the amounts of mRNA and protein. Alternatively, the amount 
of an intracellular molecule (low molecular compounds, etc.) that is changed by the function of a gene product (protein) 
that is known to be functioning in a certain cellular change is used for the detection. 

[0067] Once the screening reveals that the protein of the invenbon can regulate cellular conditions or the functions, 
it is possible to apply the protein as a pharmaceutical and diagnostic medicine for associated diseases by itself or by 
altering a part of it into an appropriate composition. 

[0068] As is above described for membrane proteins, the secretory protein provided by the invention may be used 
to explore a novel ligand-receptor interaction using a screening based on the binding activity to a known ligand or 
receptor. A similar method can be used to identify an agonist or antagonist. The resulting compounds obtained by the 
methods can be a candidate of a compound that can inhibit the interaction between the protein of the Invention and 
an interacting molecule (including a receptor). The compounds may be able to use as a preventive, therapeutic, and 
diagnostic medrcine for the diseases, in which the protein may play a certain role. 

[0069] If the protein or gene of the invention is associated with diseases, it is possible to screen a gene or compound 
that can regulate its expression antVor activity either directly or indirectly by utilizing the protein of the present invention. 
For example, the protein of the invention is expressed and purified as a recombinant protein. Then, the protein or gene 
that interacts with the protein of the invention is purified, and screened based on the binding. Alternatively, the screening 
can be performed by adding with a compound of a candidate of the Inhibitor added in advance and monitoring the 
change of binding activity. Tbe compound obtained by the screening can be used for developing phamiaceutical and 
diagnostic medicines for the diseases with which the protein of the present invention is associated. Similarty, if the 
regulatory factor obtained by the screening is a protein, the protein itself can be used as a pharmaceutical, and if there 
is a compound that affects the original expression level and/or activity of the protein, it also can be used for the same 
purpose. 

[0070] If the secrete or membrane protein of the present invention has an enzymatic activity, it is possible to identify 
the activity by adding a compound to the protein of the present invention under an appropriate condition, and monitoring 
the change of the compound. It is also possible to screen a compound that inhibits the activity of the protein of the 
invention by utilizing the activity as an index. 

[0071] In a screening given as an example, the protein of the invention is expressed and the recombinant protein is 
purified. Then, compounds are contacted with the purified protein, and the amount of the compound and the reaction 
products is examined. Alternatively, compounds that are candidates of an inhibitor are pretreated, then a compound 
(substrate) that can react with the purified protein is added, and the amount of the substrate and the reaction products 
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is examined. 

[0072] The compounds obtained in the screening may be used as a medicine for diseases with which the protein of 
the invention is associated. Also they can be applied for tests that examine whether the protein of the invention functions 
nonmally in wva 

[0073] Whether the secretory or membrane protein of the present invention is a novel protein associated with dis- 
eases or not is detennined in another method than described above, by obtaining a specific antibody against the protein 
of the invention, and examining the relationship between the expression or activity of the protein and a certain disease. 
In an alternative way, it may be analyzed referred to the methods in "Molecular Diagnosis of Genetic Diseases" (Elles 
R. edit, (1996) in the series of "Method in Molecular Biology" {Humana Press). 

[0074] The secrete or membrane protein of the present invention can be prepared as a recombinant protein or a 
natural protein. For example, a recombinant protein can be prepared by introducing a vector containing a DNA insert 
encoding the protein of the invention into an appropriate host cell, and purifying the expressed products from the 
transformant, as described below. On the other hand, a natural protein can be prepared , for example, by utilizing an 
affinity column which is bound with the antibody against the protein of the invention, as described below ("Cun^ent 
Protocols in Molecular Biology" Ausubel et al. edit. (1987) John Wily & Sons, Section 16.1 -16.19). The antibody used 
in the preparation of an affinity column can be a monoclonal antibody or polyclonal antibody. Alternatively, it is possible 
to prepare the protein of the invention by in vitro translation (See "On the fidelity of mRNA translation in the nuclease- 
treated rabbit reticulocyte lysate system." Dasso M.C.. and Jackson R.J. (1989) Nucleic Acids Res. 17:3129-3144). 
[0075] Proteins functionally equivalent to the proteins of the present invention can be prepared based on the activities, 
which were clarified in the above-mentioned manner, of the proteins of the present invention. Using the biological 
activity possessed by the protein of the invention as an index, it is possible to verify whether or not a particular protein 
is functionally equivalent to the protein of the invention by examining whether or not the protein has said activity. 
[0076] Proteins functionally equivalent to the proteins of the present invention can be prepared by those skilled in 
the art. for example, by using a method for introducing mutations into an amino acid sequence of a protein (for example, 
site-directed mutagenesis (Cunrenl Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, 
Section 8.1-8.5), Besides, such proteins can be generated by spontaneous mutations. The present invention comprises 
the proteins having one or more amino acid substitutions, deletions, insertions and^or additions in the amino add 
sequences of the proteins of the present invention (Table 1), as far as the proteins have the equivalent functions to 
those of the proteins identified in the present Exanr^ples described later. 

[0077] There are no limitations in the number and sites of amino add mutations, as far as the proteins maintain the 
functions thereof. The number of mutations is typically 30% or less, or 20% or less, or 1 0% or less, preferably within 
5% or less, or 3% or less of the total amino adds, more preferably within 2% or less or 1 % or less of the total amino 
acids. From the viewpoint of maintaining the protein function, it is preferable that a substituted amino has a similar 
property to that of the original amino acid. For example, Ala, Val. Leu, He, Pro, Met, Phe and Trp are assumed to have 
similar properties to one another because they are all classified into a group of non-polar amino acids. Similarty, sub- 
stitution can be performed among non-charged amino acid such as Gly. Ser, Thr, Cys, Tyr, Asn, and Gin, acidic amino 
adds such as Asp and Glu, and basic amino acids such as Lys, Arg, and His. 

[0078] In addition, proteins functionally equivalent to the proteins of the present invention can l>e isolated by using 
techniques of hybridization or gene amplification known to those skilled in the art. Specifically, using the hybridization 
technique (Current Protocols in Molecular Biology, edit, Ausubel et al.. (1987) John Wiley & Sons. Section 6.3-6.4)). 
those skilled in the art can usuaBy isolate a DNA highly homologous to the DNA encoding the protein identified in the 
present Example based on the identified nucleotide sequence (Table 1) or a portion thereof and obtain the functionally 
equivalent protein from the isolated DNA. The present invention include proteins encoded by the DNAs hybridizing 
with the DNAs encoding the proteins identified in the present Example, as far as the proteins are functk)nally equivalent 
to the proteins identified in the present Example. Organisms from whch the functionally equivalent proteins are isolated 
are illustrated by vertebrates such as human, mouse, rat. rabbit, pig and bovine, but are not limited to these animals. 
[0079] Washing conditions of hybridization for the isolation of DNAs encoding the functionally equivalent proteins 
are usually "1 x SSC, 0.1% SDS, 37^0"; more stringent conditions are "0.5 x SSC, 0.1% SDS, 42*C"; and still more 
stringent conditions are "0.1 x SSC, 0.1% SDS, eS'^C*. Alternatively, the following conditions can be given as hybrid- 
ization conditions of the present invention. Namely, conditions in which the hybridizaUon is done at "6 x SSC. 40% 
Formamide, 25*C", and the washing at "1 x SSC, 55*0" can be given. More preferable conditions are those in which 
the hybridization is done at "6 x SSC, 40% Fonmamide, 3rC". and the washing at "0.2 x SSC, 55'C". Even more 
preferable are those in which the hybridization is done at "6 x SSC, 50% Formamide, 37*'C", and the washing at "0.1 
X SSC, 62**C". The more stringent the conditions of hybridization are, the more frequently the DNAs highly homologous 
to the probe sequence are isolated. Therefore, it is preferable to conduct hybridization under stringent conditions. 
Examples of stringent conditions in the present invention are, washing conditions of "0.5 x SSC. 0.1% SDS. 42'C*, 
or alternatively, hybridizaUon conditions of "6 x SSC. 40% Formamide, 37" C". and the washing at '0.2 x SSC. 55**^.' 
However, the above-mentionedcombinations of SSC, SDS and temperature conditions are indicated just as examples. 
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.Those skilled in the art can select the hybridization conditions with similar stringency to those mentioned above by 
property combining the above-mentioned or other factors (for example, probe concenlration. probe length and duration 
of hybridization reaction) that determines the stringency of hybridization. 

[0080] TTie amino acid sequences of proteins isolated by using the hybridization techniques usually exhibit high 
homology to those of the proteins of the present invention, which are shown in Table 1 . The present invention encom- 
passes a polynucleotide comprising a nucleotide sequence that has a high identity to the nucleotide sequence of claim 
1 (a). Furthennore, the present invention encompasses a peptide, or protein comprising an amino acid sequence that 
has a high idenUty to the amino acid sequence encoded by the polynucleotide of daim 1(b). The term "high identity- 
indicates sequence identity of at least 40% or more; preferably 60% or more; and more preferably 707o or more. 
Alternatively, more preferable is identity of 90% or more, or 93% or more, or 95% or more, furthemiore. 97% or more, 
or 99% or more. The identity can be determined by using the BLAST search algorithm. 

[0081] With the gene amplincaiion technique (PGR) (Cunrent Protocols in Molecular Biology, edit, Ausubel et aK, 
(1 987) John Wiley & Sons, Section 6.3-6.4)) using primers designed based on the DNA sequence (Table 1 ) or a portion 
thereof identified in the present Example, it is possible to isolate a DNA fragment highly homologous to the DNA 
sequence or a portion thereof and to obtain functionally equivalent protein to a particular protein identified in the present 
Example based on the isolated DNA fragment. 

[0082] The -percent identity" of two amino acid sequences or of two nucleic adds is detemriine using the algorithm 
of Kariin and Altschul (Proc. Natl. Acad. Sei. USA 87:2254-2268, 1 990), modified as in Karlin and Altschul (Proc. Natl. 
Acad. Sei. USA 90:5873-5877, 1993). Such an algorithm is incorporated into the BLASTN and BIASTK programs of 
Altschul et al. (J. Mol. Biol.2 15:403-410. 1 990), BUVSTnudeotide searches are performed with the BUVSTN program, 
score = 100, wordlength = 12. BLAST protein searches are performed with the BLASTX program, score = 50, word- 
length 3. When gaps exist between two sequences. Gapped BLAST is utilized as descrtoed in Altschul et al. (fviudeic 
Adds Res.25:3389-3402,1997). When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g.. BUVSTX and BLASTN) are used. See httpy/www.ncbi.nlm.nih.gov. 
[0083] TTie present invention also includes a partial peptide of the proteins of the invention. The partial peptide 
comprises a protein generated as a result that a signal pepUde has been removed from a secretory protein. If the 
protein of the present invention has an activity as a receptor or a ligand, the partial peptide may function as a competitive 
inhibitor of the protein and may bind to the receptor (or ligand). In addition, the present invention comprises an antigen 
peptide for raising antibodies. For the peptides to be specific for the protein of the invention, the peptides comprise at 
least 7 amino acids, preferably 8 amino acids or more, more preferably 9 amino acids or more, and even more preferably 
10 amino adds or more. The peptide can be used for preparing antibodies against the protein of the invention, or 
compeUtrve inhibitors of them, and also screening for a receptor that binds to the protein of the invention. The partial 
peptides of the invention can be produced, for example, by genetic engineering methods, known methods for synthe- 
sizing peptides, or digesting the protein of the invention with an appropriate pepUdase. 

[0084] The present invention also relates to a polynucleotide encoding the protein of the invention. The polynudeotide 
of the invention can be provided in any fonm as far as it encodes the protein of the invention, and thus includes cDNA. 
genomic DNA. and chemically synthesized DNA, etc. The polynucleotide also indudes a DNA comprising any nude^ 
otide sequence that is obtained based on the degeneracy of the genetic code, as far as it encodes the protein of the 
invention. The polynucleotide of the invention can be isolated by the standard methods such as hybridization using a 
probe DNA comprising the nudeotide sequence set forth in odd SEQ ID NOs of SEQ ID NO: 1 to SEQ ID NO: 335, or 
the portions of them, or by PGR using primers that are synthesized based on the nudeotide sequence. 
[0085] For example, all the dones provided by the present invention, which were isolated in the example mentioned 
below, (1 73 dones) are novel and full-length, and encode a secretory protein or membrane protein. All the cDNA dones 
provided by the invention are characterized as follows, 

[0086] A full-length-enriched cDNA library that is obtained by the oligo-capping method, and selected based on the 
features of the 5'-end sequence: by the score in the ATGpr (or described as ATGprI ). which predicts the hjllness ratio 
of the S'-end. and by the PSORT. which predicts the presence of the signal sequence, as those containing the signal 
sequence in the 5'-end, or transmembrane region in the protein coding region. Furthermore, as a result of the homology 
search using the 5'-end sequences, the dones were found to be not identical to any of the known human mRNA 
(therefore to be novel). 

[0087] TTie present invention also relates to a vector into which the polynucleotide of the invention is inserted. The 
vector of the invention is not limited as long as it contains the inserted polynucleotide stably. For example, if E. coli is 
used as a host, vectors such as pBluescript vector (Stratagene) are preferable as a doning vector. To produce the 
protein of the invention, expression vectors are especially useful. Any expression vector can be used as far as it is 
capable of expressing the protein in vitro, in E. coli, in cultured cells, or in vivo. For example, pBEST vector (Promega) 
is preferable for in vitro expression. pET vector (Invitrogen) for E. coli. pME18S-FL3 vector (GenBank Accession No 
AB009864) for cultured cells, and pMEISS vector {mi Cell. Biol. (1 988) 8: 466-472) for in vivo expression. To insert 
the polynudeotide of the invention, ligation utilizing restriction sites can be performed according to the standard method 
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(Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons. Section 114-1111) 
LT^L, .Tl^'^^H '^'^'^ '° ^ <=a^i"9 '"e polynucleotide or the vector of the invention 

TJ^ . ^ '° '^'^ "'^ '"^ '"^"e*'' and various kinds of host cells can be 

used depending on the purposes. For strong expression of the protein in eukaiyotic cells. COS cells or CHO cells can 
be used, for example. ^ o v^n 

[0089] Introduction of the vector into host cells can be performed, for example, by cateium phosphate precipitation 

SJ'cf fo mT'rr'"' ''^""^'^ <1987) '^"^"bel et al edit. John Wily & Sons" 

Section 9.1-9.9). iipofectamtfie method (GIBCO-BRL). or microinjection method etc 

SL.^^ 'T"^^" "^'^'^ '° ^ oligonucleobde having a length of at least 15 nucleotides, comprising 
D NOs oT^mi^^^n^ r'.pn!n"r ' P"'^^^^ «'"'P^«"'9 ""cleotide sequence set fortZn oSd 
.11 °' ^ '° '^^^ °' complementary strand. The oligonucleotide of the present 

invemron hybndrzes with a polynucleoUde of odd SEQ ID NOs of SEQ ID NO: 1 to SEQ ID NO 3^ erlc^ZThe 
protein of the invention, or its complementary strand, under the standard conditions for hybridization or preferabV 
under stnngent condrhons. and in principle does not preferably hybridize with DNA encoding other proteins SucJ 
ol.gonucleot.de can be used as a probe for isolation and detection of the polynucleotide of the invention Snd as a 
^t^lZTJ^^T^^*"' polynucleotide of the present invention. As a primer, the DNA usually has a length of 15-100 

ITmT. , V!1"^T' "'^ °' ""^ •^^^ ^ '^"9* °' 15 bp. preferably 30 bp o^ more 

and more preferably 50 bp or more. 

[00911 Any sequence shown in SEQ ID NOs: 370-540 and that shown in SEQ ID NOs: 541-679 can be chosen as 

fr^X^lf "^""^"^ comprising the 5'-end primer and the 3'-end primer. respectiveV, to synthesize the full-length 

^It t^^r '":iT- "^""^ ""^^ "'^'^""'^ ^'-^ ""cleotide sequences hie 

already been known as EST sequences, the primers designed based on the present invention is novel in that thej 

hil! ITp^ EST sequences do not serve to design such prIrS 

Sfrh tlr."!."."^ f "^"^ '"^ "'^^^"f '"■'f'" corresponding cDNAs. 

[00921 Each of the full-length cDNAs of the present inventions can be synthesized by PCR (Current Protocols in 
Molecular B.ology. ed.. Ausubel el al.. (1987) John Wiley & Sons, Section 6.1-6.4) using a pal of prirnel S^J! 
?r„H ^--^^ '^^'"^ - --9 a primer pair consisting o?a primer seS ro^S 

5^nd sequences and a pnmer with oligo(dT) sequence complementary to the poly(A) sequence 

ELf If ^ ^""^^ """""^ ^ Oligonucleotide that has 15 nucleotides tonger, and specifkally 

the 5 LI 1 complementa^ strand of me poVnucleotide that conmins the nucleotide sequVnce selSted from 
\ T'^^*'""^ ^^"'^ 370-540). and an oligo-dT primer as a 5'-, and 3--primer 

IS descnbed below, the pnmer length of 25-35 bp may provide a good result 

A method to design a primer that enables a specific amplification based on the given nucleotide sequence is known 
to hose sk.lled.ntheart(CunentProtocolsin Molecular Biology.Ausubeletal.edi^^ 

6.1-6.4). In des.gn,ng a primer based on the S-end sequence, the primer is designed so as that, in principle the 

L """^"^'^'^^^ '^9'°" UTR)- any part of the sequence can be used as a S'-primer as far as the specificity 
nSiSo'^ " "^^ P^^^'^ "^'"9 a Jwn method s^h as 

[00941 When synthesizing a full-length cDNA. the target nucleotide sequence to be amplified can extend to several 
thousand bp .n some cDNA.However.il is possible toamplity such 

and Accurate PCR). It « advantageous to use LA PCR when synthesizing long DNA. In UV PCR. in which a speS 

PCrT^ ^2r.l!:?°TT'"'7 '"'''^'^ « "-"S "'«^«'«''« ^^--nce. By usin^'l^ 

SJI Mool .1^?''"^'°" °' ^ ""^'^""''^ ^ ^ achieved under desirable conc^tion 

(Takesh. Hayash. (1996) J.kken-lgaku Bessatsu. "Advanced Technologies in PCR" Youdo-sha) 
[M95] A template DNA for synthesizing the cDNA of the present Ihvention can be obtained by using cONA libraries 
that are prepared by vanous methods. T7,e fulHength cDNA ctones obtained here are those ^th high fullneS rat.? 
whch were ob,a.ned using a combination of (1) a method to prepare a full-length-enriched cDNA ll,ra7uTng me 

estima K)n by the ATGpr after removing clones that are non-full-length compared to the ESTs). However it is possible 

?erediSiir;rr^""^'"^^^^^^ 

SoH^^'T'" ''"!^"'' ''^'^"^ "^^ or commercially available is that mRNA 

r S T"'" " '""■'«"9'f' CDNA Clone directly Z 

■he library using ordinary clon.ng methods. The present invention has revealed a primer that is capable of synth^izTng 
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a full-length cDNA. If provided with primers, it is possible to synthesize a target full-length cDNA by using enzymatic 
reactions such as PGR. In particular, a full-length-enriched cDNA library, synthesized by methods such as oligoK:ap- 
ping. is desirable to synthesize a full-length cDNA with more reliability. 

[00971 Transcriptional regulatory regions including promoters in the genome can be isolated by uUIizing the S-end 
sequences of the full-length cDNA clones of the present invention. TTie rough draft (slightly inaccurate sequencing 
result obtained in the analysis of human genome) covering 90% or more of the entire human genome is expected to 
be achieved in the spring of 2000, and the entire analysis of human genome sequence is expected to be completed 
by 2003. Because of the presence of long introns. it is hard to determine the transcription initiation sites in human 
genome by using analytical software. TDe utilization of the 5'-end sequences of the fuH-length cDNA sequences of the 
present invention makes it easy to isolate promoter-containing genomic regions that are located upstream of transcrip- 
tion initiation sites and are involved in mRNA transcription regulation. TTiis is because the mRNA transcription initiation 
sites in the genome can be identified easily based on the 5'-end sequences of the full-length cDNAs. 
[0098] TTie polynucleotide of the present invention can be used for examination and diagnosis of the abnormality of 
the protein of the invention. For example, it is possible to examine the abnomnal expression of the gene encoding the 
protein using the polynucleotide of the invention as a probe for Northern hybridization or as a primer for RT-PCR. Also, 
the polynucleotide of the invention can be used as a primer for polymerase chain reaction (PGR) such as the genomic 
DNA-PGR. and RT-PCR to amplify the polynucleotide encoding the protein of the invention, or the regulatory region 
of the expression, with which it is possible to examine and diagnose the abnonnality of the sequence by RFLP analysis. 
SSGP, and direct sequencing, etc. 

[00991 Furthemnore. the "polynucleotide having a length of at least 1 5 nucleoUdes, comprising a nucleotide sequence 
that is complementary to a potynucleotide comprising the nucleotide sequence set forth in odd SEQ ID NOs of SEQ 
ID NO: 1 to SEQ ID NO: 335. or its complementary strand" includes an antisense polynucleotide for suppressing the 
expression of the protein of the invention. To exert the antisense effect, the antisense polynucleotide has a length of 
at least 15 bp or more, for example. 50 bp or more, preferably 1 00 bp or more, and more preferably 500 bp or more, 
and has a length of usually 3000 bp or less and preferably 2000 bp or less. The antisense DNA can be used in the 
gene therapy of the diseases that are caused by the abnonDality of the protein of ihe invention (abnonnal funcUon or 
abnomnal expression). Said antisense DNA can be prepared, for example, by the phosphorothioate method ("Physic- 
ochemical properties of phosphorothioate oligodeoxynucfeobdes.- Stein (1988) Nucleic Acids Res. 16: 320^3221) 
based on the nucleotide sequence of the DNA encoding the protein (for example, the DNA set forth in odd SEQ ID 
NOs of SEQ ID NO: 1 to SEQ ID NO: 335). 

[01 00] TTie polynucleotide or antisense DNA of the present invention can be used in gene therapy, for example, by 
administrating it into a patient by the in vivo or ex vivo method with vims vectors such as retrovinjs vectors, adenovirus 
vectors, and adeno-associaled vims vectors, or non-virus vectors such as liposome. 

[01 01 ] The present invention also relates to antibodies that bind to the protein of the invention. Jhere are no limitations 
in the fonn of the amibodies of the invention. They include polyclon al anUbodies, monoclonal antibodies, or their portions 
that can bind to the antigen. They also include antibodies of all classes. Furthermore, special antibodies such as 
humanized antibodies are also included. 

[0102] The polyclonal antibody of the invention can be obtained according to the standard method by synthesizing 
an oligopeptide conresponding to the amino add sequence and immunizing rabbits with the peptide (Gun-ent Protocols 
in Molecular Biology (1987) Ausubel et al. edit. John Wily & Sons. Section 11.12-11.13). The monoclonal antibody of 
the invention can be obtained according to the standard method by purifying the protein expressed in E. coli. immunizing 
mice with the protein, and producing a hybridoma cell by fusing the spleen cells and myeloma cells (Current Protocols 
in Molecular Biology (1987) Ausubel etal. edit. John Wily & Sons. Section 11.4-11.11). 

[0103] The antibody binding to the protein of the present invention can be used for purification of the protein of the 
invention, and also for detection and^or diagnosis of the abnonmalities of the expression and structure of the protein. 
Specifically, proteins can be extracted, for example, from tissues, blood, or ceils, and the protein of the invention is 
detected by Western blotting, immunoprecipitation, or ELISA. etc. for the above purpose. 

[01 04] Furthemnore. the antibody binding to the protein of the present invention can be utilized for treating the dis- 
eases that associates with the protein of the invention. If the antibodies are used for treating patients, human antibodies 
or humanized antibodies are preferable in terms of their low antigencity. The human antibodies can be prepared by 
immunizing a mouse whose immune system is replaced with that of hunrmn ("Functional transplant of megabase human 
immunoglobulin loci recapitulates human antitwdy response in mice" Mendez M J. et al. (1 997) Nat. Genet. 15:146-1 56. 
for a reference). TTie humanized antibodies can be prepared by recombination of the hypen/ariable region of a mon- 
oclonal antibody (Methods in Enzymology (1991) 203: 99-121). 

[0105] Tlie present invention further relates to databases comprising at least a sequence of polynucleotides and/or 
protein, or a medium recorded in such databases, selected from the sequence data of the nucleotide and/or the amino 
acids indicated in Table 1 . The term "database" means a set of accumulated infomnation as machine-searchable and 
readable infonnation of nucleotide sequence. TTie databases of the present invention comprise at least one of the 
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novel nudeotide sequences of polynucleotides provided by the present invention. The databases of the present inven- 
tion can consist of only the sequence data of the novel polynucleotides provided by the present invention or can com- 
pnse other information on nucleotide sequences of known full-length CDNAs or ESTs. The databases of the present 
invention can be comprised of not only the infomialion on the nucleotide sequences but also the information on the 
gene functions revealed by the present invention. Additional information such as names of DNA clones carrying the 
full-length cDNAs can be recorded or linked together with the sequence data in the databases. 
[01061 Ttie database of the present invention is useful for gaining complete gene sequence information from partial 
sequence information of a gene of interest. TTie database of the present invention conprises nucleotide sequence 
infomiation of full-length cDNAs. Consequently, by comparing the infonmation in this database with the nucleotide 
sequence of a partial gene fragment yielded by differential display method or subtraction method, the infomiation on 
the full-length nucleotide sequence of interest can be gained from the sequence of the partial fragment as a starting due 
[0107] The sequence infomiation of the full-length cDN As constituting the database of the present invention contains 
not only the information on the complete sequences but also, extra information on expression frequency of the genes 
as well as homology of the genes to known genes and known proteins. Thus the extra infomiation facilitates rapid 
functional analyses of partial gene fragments. Further, the infomiation on human genes is accumulated in the database 
of the present invention, and therefore, the database is useful for isolating a human homologue of a gene originating 
from other species. The human homologue can be isolated based on the nucleotide sequence of the gene from the 
original species. 

[01081 At present, infonmation on a wide variety of gene fragments can be obtained by differential display method 
and subtraction method. In general, these gene fragments are utilized as tools for isolating the full-length sequences 
thereof. When the gene fragment conresponds to an already-known gene, the full-length sequence is easily obtained 
by comparing the partial sequence with the infomnation in known databases. However, when there exists no information 
corresponding to the partial sequence of interest in the known databases. cDNA cloning should be carried out for the 
full-length CDNA. It is often diffk;ull to obtain the full-length nucleotide sequence using the pamal sequence information 
as an initial clue, if the fulMength of the gene is not available, the amino acid sequence of the protein encoded by the 
gene remains unidentified. Thus the database of the present invention can contp-bute to the identification of full-length 
cDNAs conresponding to gene fragments, which cannot be revealed by using databases of known genes. Tlie present 
invention has provided 173 proteins that are novel secretory proteins or membrane proteins, and fulMength cDNA 
clones encoding the proteins. It has great signincance to provide a novel full-length cDNA clone of humans as only 
few a of which have been Elated. It was found that the secretory proteins and membrane proteins of the present 
invention are associated with many diseases. Those genes and proteins associated with diseases are useful for de- 
veloping medcines as they can be used as a diagnostic marker, or a target for gene therapy or developing medfcines 
that IS capable of regulating their expression and activity. Especially, the cDNA clones encoding a secretory protein 
are extremely important for medicinal industry since the protein itself is expected to be effective as a medicine and 
also the gene may have potential to be associated with many diseases. Moreover, those proteins such as membrane 
proteins and the genes encoding the proteins may be used as a disease marker. These cDNA clones are also important 
for medicinal industry as they may be effective for treating diseases through the regulation of the expression and activity 
of their encoded proteins. 

[01 09] The invention is illustrated more specifically with reference to the following examples, but is not to be constmed 
as being limited thereto. 

EXAMPLE 1 

Construction of a cDNA library by the oligo-capping method. 

[0110] The NT-2 neuron progenitor cells (Stratagene). a teratocarcinoma cell line from human embryo testis which 
can differentiate into neurons by treatment with reiinoic acid were used. The NT-2 cells were cultured according to the 
manufacturer's instructions as follows. 

(1) NT-2 cells were cultured without induction by retinoic acid treatment (NT2RM1). 

(2) Alter cultured, NT-2 cells were induced by adding retinoic acid, and then were cultured for 48 hours (NT2RP1 ). 

(3) After cultured, NT-2 cells were induced by adding retinoic acid, and then were cultured for 2 weeks (NT2RP2). 

[0111] The cells were harvested separately, from which mRN A was extracted by the method described in the literature 
(Molecular Cloning 2nd edition. Sambrook J., Fritsch. E.F., and Maniatis T. (1989) Cold Spring Harbor Laboratory 
Press). Furthermore. poly(A)+RNA was purified from the mRNA using oligoKJT cellulose. 

Similarly, human placenta tissues (PLACE1), human ovary cancer tissues (OVARCl). and human embryo-derived 
tissues that were enriched with brain (HEMBA1 ) were used to extract mRNA by the method described in the literature 
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(Molecular Cloning 2nd edition. Sambrook J., Frilsch, E.F., and Manialis T. (1989) Cold Spring Harbor Laboratory 
Press). Furthenmore, poly(A)+RNA was purified from the mRNA using oligo-dT cellulose. 

[0112] Each pofy(A)+RNA was used to construct a cDNA library by the oligo-capping method (Maruyama M. and 
Sugano S. (1 994) Gene 1 38: 1 71 -1 74). Using the OligcKap linker (SEQ ID NO: 337) and the Oligo-dT primer (SEQ 
ID NO: 338), BAP (bacterial alkaline phosphatase) treatment, TAP (tobacco acid phosphatase) treatment. RNA ligation, 
the first strand cDNA synthesis, and removal of RNA were pertomied as described in the reference (Suzuki and Kanno 
(1996) Protein Nucleic ackJ and- Enzyme. 41: 197-201; Suzuki Y. et al, (1997) Gene 200: 149-156). Next, 5'- and3'- 
PCR primers (SEQ ID NO: 339, and 340, respectively) were used for perfomriing PGR (polymerase chain reaction) to 
convert the cDNA into double stranded cDNA. which was then digested with SfiL Then, the Dralll-cleaved pUC19FL3 
vector (Figure 1; for NT2RM1. and NT2RP1). or the Dralll-deaved pME18SFL3 (Figure 1) (GenBank AB009864. ex- 
pression vector: for NT2RP2. NT2RP3. Pl_ACE1 . 0VARC1 . and HEMBA1) was used for cloning the cDNA in an'uni- 
directional manner, and cDNA libraries were obtained. The clones having an insert cDNA with a length of 1 kb or less 
were discarded from NT2RM1 . NT2RP1 , NT2RP2, Pl^CEl , 0VARC1 , and HEMBAI , and the clones having an insert 
cDNA with a length of 2 kb or less were discarded from NT2RP3. Then, the nucleotide sequence of the 5'- and 3'- ends 
of the cDNActones was analyzed with a DNA sequencer (ABI PRISM 377, PE Biosystems) after sequencing reactions 
were perfonmed with the DNA sequencing reagents (DyeTenminalor Cyde Sequencing FS Ready Reaction Kit, dRhod- 
amine Temninator Cycle Sequencing FS Ready Reaction Kit, or BigDye Terminator Cycle Sequencing FS Ready Re- 
action Kit, from by PE Biosystems) according to the instructions. 

[0113] Ihe so analyzed 5'-end and 3'-end nucleotide sequences of the clones are shown in SEQ ID NOs: 370-540 
and in SEQ ID NOs: 541 -679, respectively. The SEQ IDs and the corresponding PSEC clones are as indicated in Table 
342. 

[0114] The cDNA libraries' of NT2RP2 and HEMBAI were constructed using eukaryotic expression vector 
pMEl 8SFL3. The vector contains SRa promoter and SV40 small t intron in the upstream of the cloning site, and SV40 
polyA added signal sequence site in the downstream. As the cloning site of pMEl 8SFL3 has asymmetrical Drain sites, 
and the ends of cDNA fragments contain Sfil sites complementary to the Dralll sites, the cloned cDNA fragments can 
be inserted into the downstream of the SRa promoter unidirectionally. Therefore, clones containing full-length cDNA 
can be expressed transiently by introducing the obtained plasmid directly into COS cells. Thus, the ctones can be 
analyzed very easily in ierws of the proteins that are the gene products of the clones, or in terms of the biologk:aJ 
activities of the proteins. 

[01 1 5] The fullness ratio at the 5*-end sequences of the cDNA clones in the Itoraries constructed by the oltgoKjapping 
method was detemnined as follows. Of all the clones whose S'-end sequences were found in those of known human 
mRNA in the public database, a clone was judged to be "full-length", if it had a longer 5'-end sequence than that of the 
known human mRNA, or, even though the 5'-end sequence was shorter, it it contained the translation initiation codon. 
A clone that dki not contain the translation initiation codon was judged to be "non-full-length". The fullness ratio ((the 
number of fulMength clones)/(the number of full-length and non-full-length clones)) at the 5'-end of the cDNA clones 
from each library was detenmined by comparing with the known human mRNA (NT2RM1: 69%; NT2RP1: 75%; 
NT2RP2: 62%; NT2RP3: 61%; PUVCE1: 68%; OVARCI: 59%; and HEMBAI: 53%). Tlie result indicates that the 
fullness ratio at the 5'-end sequence was extren>ely high. 

[01 16] The relationship between the cDNA libraries and the clones is shown below. 

NT2RM1 : PSEC0001-PSEC0017 
NT2RP1 : PSEC0019-PSEC0047 
NT2RP2 : PSEC0048-PSEC0085, 

PSEC0092-PSEC0109. 

PSEC0111-PSEC0113, PSEC0173 
NTi2RP3 : PSEC0241-PSEC0265 
PLACE1 : PSEC0086-PSEC0090, PSEC0110, 

PSEC0117-PSEC0172 
0VARC1 : PSEC0178-PSEC0183, PSEC0239-PSEC0240 
HEMBAI : PSECOl 90-PSEC0237 



EXAMPLE 2 



Estimation of the fullness ratio at the 5-end of the cDNA by the ATGpr and the ESTiMateFL. 

[01 17] The ATGpr, developed by Salamov A.A.. Nishikawa T, and Swindells M.B. in the Helix Research Institute, is 
a program for prediction of the translation initiation codon based on the characteristics of the sequences in the vicinity 
of the ATG codon [A. A. Salamov. T Nishikawa, M. B. Swindells. Bioinformatics, 14: 384-390 (1998); httpy/www.hri. 
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^TTn^]^a.l^ '^iso described as ATGpri below) that an ATG is a true initiation 

K : t. K i ^"^'^"^ '"^ ^PP"^"^ '° sequences of the clones from the cONA library that 

was obtained by the oligo-capping method and that had 65% fullness ratio, the sensitivity and specificity of estimation 
of a full-tength clone (done containing the N-terminal end of ORI^ were improved to 82-83% by selecting only dones 
havmg the ATGpM score 0.6 or higher. Furthemiore. the 1 7.365 clones in which the 5'-end sequence is identical to a 
known human mRNA and which were cloned from the human cDNA libraries constmded by the oligo-cappinq method 
were estimated by the program. Briefly, the maximal ATGpri score of the dones was detemiined, and then their 5'- 
end sequence was compared with the known human mRNA to esUmate whether the done is lull-length or not The 
result wassummarized in Table 2. It is indicated that the method for the selection through the combination of the ATGor 
and the dones isolated from the human cDNA library that was constructed by the oligo-capping method was vely 



maximal ATGpn Score 


number of full-length and 
not-full-length clones 


number of full-length clones 


fullness ratio 


>-0.70 
>=0.50 
>=0.30 
>=0.17 
>=0.05 


10,226 
12,171 
14,102 
15.647 
17,365 


8,428 
9,422 
10,054 
10,385 
10,608 


82.4% 
77.4% 
71.3% 
66-4% 
61.1% 


numoer or ruiHengtn clones, the number of the clones which contain the N-term 
of not-full-length clones, number of the clones which does not contain the N-terminus o 
resulting number of (the number of full-length clonesy(the number of full-length and n 


nus of the ORF; the number 
f the ORF; fullness ratio, the 
ot-fuil-length clones) 



[01181 The ESTlMateFL. developed by Nishikawa and Ota in the Helix Research Institute, is a method for the se- 
S"as°e ^ '^""^ ^"'^^ '"""^ ^ ""^""9 Of 3"-e"d sequences of ESTs in the pubfc 

[01 19] ^ the method, a cDNA clone is judged presumably not to be full-length if there are any ESTs that have tonqer 
fnZZ VT^T"^^ '"^ '"^"''^ systematized for high throughput analysis. A done is judSl 

to be fun-length .f the done has a longer 5'-end sequence than ESTs in the public database. Even if a clone h^ 

nnM K .Tf ' K?"^ ^""^^ '"""'^"^''^ " '^^ '^'"'^'^^^ length is within 50 bases, and otherwise judged 
not tobe full-length, for convenience. Theprecision of the prediction by comparing cDNA clones with ESTs is improved 
w.m mcreasing nurnber of ESTs to be compared. However, when only a limited number of ESTs are available the 
,s^^^,S^"'f^ ^"^"^^ clones with high probability of being not-fulHength 

^ T,,^ f T !! by the oligoHcapping method and that have the S'-end sequences with atwut 

of d3NA of a human unknown mRNA that has a significant number of ESTs in the public database 
[0120] TTie results were summarized in Tables 3 and 4. It was confirmed that, in estimating the fullness ratio at the 
5 -endsequence of theclonesof the human cDNA library that was constructed by the oligo-capping method thefullness 
ratio was improved even for the dones having low score in the ATGpr by combining the ATGprand ESTll^ateR. The 
result was applied to the estimation of the fullness ratio at the 5'-end sequence of the clones whose complete cDNA 
sequences were detemiined. TDe number of full-length clones, the number of not-full-length dones. and the fullness 
ratio indicate the number of the dones which contain the N-tem,inus of the ORF. the number of the dones whid> does 

I!!! .r„r T.T"^ ^"'^ "'^ '^""■■"9 """iberof (the number of full-length donesy(the number of 

full-length and not-full-length dones), respectively. 



Table 3 



The fullness ratio at th 
the ORF of the known hun 
evaluated by comparing th 


e 5 -end sequence of the cDNA clones that were judged to be full-length by comparing 
nan mRNA and that were obtained by the oligo-capping method, wherein the ratio was 
e cDN A clones with ESTs. 


maximal ATGpri Score 


number of full-length 
clones 


number of nol-fuil-length 
clones 


fullness ratio 


>=0.30 

>=ai7 

>-0.05 


8,646 
10,158 
15.351 


907 
1,150 
2.728 


90.5% 
89.8% 
84.9% 
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Table 4 



TTie fullness ratio ar the 5'-end sequence of the cDNA clones that were judged to be not-full-length by comparing 
the ORF of the known human mRNA and that were obtained by the oligo-capping method, wherein the ratio was 
evaluated by comparing the cDNA clones with ESTs. 


maximal ATGprI Score 


number of full-length 
clones 


number of nol-full-length 
clones 


fullness ratio 


>=0.30 
>=0.17 
>=0.05 


1,271 
1.678 
2.512 


2.156 
2.907 
4.529 


37.1% 
36.6% 
35.7% 



EXAMPLE 3 



Selection of the clones containing the signal sequence and the full-length-enriched clones. 

[01211 From the clones in each library constructed by the oligoK^apping method, those clones predicted to contain 
the signal sequence (most likely to be a secretory protein or membrane protein) were specifically selected by analyzing 
the amino acid sequence that are predicted by all the ATG codons within the 5'-end sequence, for the presence of the 
signal peptide, which is characteristk; in the N-terminus of many secretory proteins, by using the PSORT. devetoped 
by Nakai and Kanehisa, which predicts the localization of a protein. 

[0122] PSEC0001-PSECOa56 were not selected by the ATGpr score of the 5'-end sequence (one pass sequencing) 
but selected by the presence of both the signal sequence (analyzed by the PSORT). and the ORF (Open reading 
frame; a region translated to be amino acids) in the 5'-end sequence. PSEC0068-PSEC0265 were selected as those 
having the maximal ATGprI score of the 5'-end sequence (one pass sequencing) 0.7 or higher, in which both the signal 
sequence (analyzed by the PSORT) and the ORF exist in the S'^end sequence. 

EXAMPLE 4 



Analysis of the complete cDNA sequence and classification by categories. 

[0123] For the 173 clones selected In Example 3, the nucleotide sequences of the full-length cDNA and the deduced 
ammo acid sequences were detenmined. The nucleotide sequences were finally determined by overlapping completely 
the partial nucleotide sequences determined by the following three methods. The amino acid sequences were deduced 
from the detenmined cDNA sequences. The results were shown in SEQUENCE LISTING (Only the results of the 173 
clones that were classified into a secretory protein or membrane protein were shown). 

(1) Long-read sequencing from both ends of the cDNA inserts using a Licor DNA sequencer (After sequence 
reactions v^re perfomned according to the manual for the Licor sequencer (Aroka). DNA sequence was determined 
by the sequencer.) 

(2) Nested sequencing by the Primer Island method which utilizes the in vitro transfer of AT2 transposon (Devine 
S.E.. and Boeke J.D. (i 994) Nucleic Acids Res. 22: 3765-3772) (After clones were obtained using a kit from PE 
Biosystems. sequence reactions were performed using the DNA sequencing reagents from the company, according 
to the manufacturer's Instructions, and DNA sequence was detenmined using an ABI PRISM 377 sequencer.) 

(3) Primer walking by the dideoxy tenminator method using custom synthesized DNA primers (After sequence 
reactions were perfooned using the DNA sequencing reagents from PE Biosystems and custom synthesized DNA 
pnmers according to the manufacturer's instmctions. DNA sequence was determined using an ABI PRISM 377 
sequencer). 

[01 24] These sequences were subjected to the analysis by the ATGpr and PSORT and also to the 8UVST search 
of the GenBank and SwissProt. As a result, most clones (152 clones out of 173) were predicted to be a secretory 
protein or membrane protein that contains a signal sequence in the N-terminus. Furthermore, those clones in which 
a s^nal sequence was not found by the PSORT. (PSEC0027. PSEC0047. PSEC0066. nnnnnnnn. PSEC0069 
PSECO092. PSEC0103. PSEC0117. PSEC0142. PSEC0212. PSEC0239. PSEG0242. PSEC0251 PSECOSSe' 
PSEa)006. PSEC0043, PSEC0058. PSEC01 95. PSEC0206. and PSEC0211 ) were subjected to the analysis by the 
MEMSAT and SOSUI for the identity as a membrane protein (containing the transmembrane helix). As a result 14 
clones among the 20 clones were predicted to contain the transmembrane helix (PSEC0027. PSEG0047. PSEC0066, 
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nnnnnnnn, PSEC0069, PSEC0092, PSEC0103, PSEC0117. PSEC0142, PSEC0212. PSEC0239, PSEC0242, 
PSEC0251 . and PSEC0256). THus. the clones were predicted to be a membrane protein. As a result of the homotogy 
search of the SwissProt. PSEC01 95 and PSEC0206 were found to have relatively high homology with mouse plasma 
membrane adapter HA2/AP2 adaptin a C subunil. and human cart)oxypeptidase H precursor (prohormone processing 
car1x>xypeplidase) in the secretory granule, respectively. 

[0125] The above results were shown in List i , List 2. and List 3. Therein, the function of each cDNA clone (annotation) 
was shown as well. The categories of the 168 clones out of 1 73 clones were shown in the foliowings. 

1 . Clones that are predicted to be a full-length cDNA clone encoding a secretory protein or membrane protein (1 68 
clones) ^ 
(Most clones have the ATGpri score 0.5 or higher). 

1 ) Clones that are predicted to be a full-length cDN A clone encoding a secretory protein or membrane protein, 
in which a signal sequence is present in the N-terminus (152 clones, List 1). PSEC0001 PSEC0049PSEC0085 
PSEC0113 

nnnnnnnn PSEC0051 PSEC0086 PSEC0119 
PSEC0005 PSEC0052 PSEC0087 PSEC0120 

PSEC0007 PSEC0053 PSEC0088 PSECOl 2 1 

PSEC0008 PSEC0055 PSEC0090 PSECOl 24 

PSEC0012 PSEC0059 PSEC0094 PSECOl 25 

PSEC0017 PSEC0061 PSECO095 PSECOl 26 

PSEC0019 PSEC0068 PSEC0098 PSEC0127 

PSEC0020 PSEC0070 PSEC0099 PSECOl 28 

PSEC0021 PSEC0071 PSECOl 00 PSECOl 29 

PSEC0028 PSEC0072 PSECOl 01 PSECOl 30 

PSEC0029 PSEC0073 PSECOKM PSEC0131 

PSEC0030 PSEC0074 PSEC0105 PSEC01 33 

PSEC0031 PSEC0075 PSEC0106 PSECOl 34 

PSEC0035 PSEC0076 PSECOl 07 PSECOl 35 

PSEC0038 PSEC0077 PSECOl 08 PSECOl 36 

PSEC0040 PSEC0079 PSECOl 09 PSECOl 37 

PSEC0041 PSEC0080 PSEC0110 PSECOl 39 

PSEC0045 PSEC0081 PSEC0111 PSEC0143 

PSEC0048 PSEC0082 PSEC0112 PSEC0144 

nnnnnnnn PSECOl 78 PSEC0216 PSEC0247 

PSEC0147 PSEC0181 PSEC021 8 PSEC0248 

PSECOl 49 PSECOl 82 PSEC0220 PSEC0249 

PSEC0150 PSECOl 83 PSEC0222 PSEC0250 

PSEC0151 PSECOl 90 PSEC0223 PSEC0252 

PSECOl 52 PSECOl 91 PSEC0224 PSEC0253 

PSEC0158 PSEC0192 PSEC0226 PSEC0255 

PSECOl 59 PSECOl 97 PSEC0227 PSEC0258 

PSEC0161 PSEC0198 PSEC0228 PSEC0259 
PSEC0152 PSEC0t99 PSEC0230 PSEC0260 
PSECOl 63 PSEC0200 PSEC0232 PSEC0261 
PSEC0164 PSEC0203 PSEC0233 PSEC0263 
PSEC0165 PSEC0204 PSEC0235 
PSECOl 67 PSEC0205 PSEC0236 
PSEC0168 PSEC0207 PSEC0240 
PSEC0169 PSEC0209 PSEC0241 
PSECOl 70 PSEC0210 PSEC0243 
PSECOl 71 PSEC0213 PSEC0244 
PSECOl 72 PSEC0214 PSEC0245 
PSECOl 73 PSEC0215 PSEC0246 
(Annotation 1 ) 

Clones that have the ATGpri score 0.5 or lower (PSEC0017, ATGpn 0.33; PSEC0030 ATGpn 026- 
PSEC0031 . ATGpri 0.20; PSEC0049. ATGpri 0.35): These clones, in which data of the 5'-end sequence (one 
pass sequencing) was not sorted by the ATGpr, were selected as a clone having both the signal sequence 
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and long ORF based on the data of the 5'-end sequence, and the sequence of their full-length cDNA clones 

was determined. All the clones have a signal sequence in the N-temiinus. In addition, the above 4 clones 

except PSEC0049 have longer 5'-end compared to the corresponding EST PSEC0D49 has an ORF that has 

longer S'-end than that of EST. Thus, these clones turned out to be full-length cDNA clones. 

2) Clones that are predicted to be a full-length cDNA encoding a secretory protein or membrane protein, in 

which the signal sequence is not present in the N-tenninus, and predicted to be a membrane protein (14 

clones, List 2). 

PSEC0027 

PSEC0047 

PSEC0066 

nnnnnnnn 

PSEC0069 

PSEC0092 

PSEC0103 

PSEC0117 

PSEG0142 

PSEC0212 

PSEC0239 

PSEC0242 

PSEC0251 

PSEC0256 

(Annotation 3) 

Clones that have the ATGpn scoreO.5 orlower(PSEC0239. ATGpri 0.18); PSEC0239 was selected 
as a crone having high ATGpri score of the 5*-end sequence (one pass sequencing), in which the signal 
sequence was predicted to be present. Although this done was predicted to be without the signal sequence 
in the N-terminus according to the predicted ORF after complete sequencing, the clone was predicted to 
be a membrane protein (having the transmembrane helix) by the MEMSAT and SOSUI. In addition, the 
clone was found to contain a longer 5*-sequence than ESTs by comparing with them. 
(Annotation 4) 

PSEC0242 and PSEC0251 : Both clones are classified into the cDNA encoding the polypeptide 'con- 
taining a signal sequence in the N-terminus". if translation starts from their third ATG codon. 
PSEC0242: No.3 ATG, ATGpri 0.82. SP-Yes. ORF 171-1343 391 aa. Signal peptide 24; 
PSEC0251: No.3 ATG. ATGpri 0.77, SP-Yes. ORF 116-1256 380 aa, Signal peptide 28.' 

2. Clones that are predicted to be neither of a secretory protein or membrane protein by the PSORT. MEMSAT. 
and SOSUl. but predicted to be full-length by the ATGpr. which were isolated from the full-length-enriched humarl 
cDNA libraries constructed by the oligoK^apping method (2 clones) 

(Both clones have the ATGpr score 0.5 or higher). 

PSEC01 95. and PSEC0206. 

[0126] According to the result of the homology search of the SwissProt. PSEC0195 and PSEC0206 were found to 
have relatively high homology with mouse plasma menrrbrane adapter HA2/AP2 adaptin a C subunil. and human car- 
boxypeptidase H precursor (prohonmone processing carboxypeptidase) in the secretory granule, r^peclively. Thus, 
the proteins are classified into the category of "a secretory protein or membrane protein" (see IJst3). 

EXAMPLE 5 

Selection of clones predicted to have signal sequences 

[0127] Specific selection was carried out for clones predicted to have signal sequences (having high probability of 
being seaetory and/or membrane proteins) by testing the presence of a sequence predicted as a characteristic signal 
peptide found in amino-terminal sequences of many secretory proteins. The selection was pertormed by surveying all 
the possible amino acid sequences that are initialed with distinct ATG codons located in the 5'-end sequence and that 
are encoded by a cDNA isolated from each library prepared by oligoncapping method, by using a computer program, 
"PSORr developed for predicting domain localization in a protein by Nakai and Kanehisa. Specifically, based on the 
5'-end sequence data (one pass sequencing), the clones were selected under the conditions that the signal sequence 
(analyzed by PSORT) had a maximal ATGpri value of 0.7 or higher and the corresponding ORF was found in the 5'- 
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end sequence. 

[0128] The correspondence between the clones and the cDNA libraries is as follows- 
NT2RP2: PSEC0078. PSEC0084 
NT2RP3; PSEC0264, PSEC0265 
5 HEMBA1:PSEC0237 



EXAMPLE 6 
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Sequencing of the full-length cDNAs and categorization thereof 

[0129] Nucleotide sequences were determined lor the 5 (ull-length cDNAs selected In Example 5 by assembling the 
sequence data derived from both strands. Amino acid sequences were then deduced from the full-length nucleotide 
sequences. The sequences were subjected to the analyses with ATGpr and PSORT programs. Furthemiore, databases 
such as GenBank and SwissProt were searched for the full-length sequences by BLAST There were 4 clones 
(PSEC0084.PSEC0237.PSEC0264.andPSEC0265) that were predictedtoencode'secretoryprotJnsT^^^^^^ 
sequences at ihe.r N-temnini. As for another clone (PSEC0078). no signal sequence was detected in the deduced 
ammo aad sequence thereof by PSORT. By using IWEMSAT and SOSUI programs, this clone was further analyzed to 
assess whether or not the protein encoded by this clone was a membrane protein (having a transmembrane helix) 
The result showed that a transmembrane helix was predicted to be present in this protein. In other words the protein 
was presumed to be a membrane protein. ' 
[01301 From the matching data obtained by BLAST analysis, matching data including information on proteins whose 
unctions were relatively easy to be predioled w^re chosen to present them herein. Some clones were however, se- 
le«ed simply because of the high homology in the matching data. These results are shown in List 1 and List 2 tooe'ther 
with the annotation of the function of each cDNA clone. The categorization of the 5 clones is described below 
[0131] Results obtained by BLAST analysis are presented herein for the above-mentioned clones other than the 5 
Clones based on the same criterion as mentioned above for the selection 

^^^"^^ '° '"'Hengf CDNA sequences and to encode secretory and/or membrane proteins (5 

clones predicted to cover the full-length cDNA sequences and to encode secretory and/or membrane proteins with 
signal sequences at the N-temninal ends thereof (4 clones) (List 1) (ATGprI value is 0 5 or higher) 
PSEC0084, PSEC0237, PSEC0264, PSEC0265 

a clotie predicted to cover the fulMength cDNA sequence and to encode secretory and/or membrane protein without 
signal sequence at the N-terminal end thereof (1 clones) (List 2) PSEC0078 

(Annotation) The ATGpn value was 0.24. This is a clone exhibiting high ATGprI value and selected as having a signal 
sequence ,n the predidion based on the S'-end sequence data (one pass sequencing). However, based on the ORF 
deduced from the full-length sequence determined later, this clone has been finally judged not to have the signal 
sequence at the N-tenminus thereoL Nonetheless, theclone has been predicted to encode a membrane protein (having 
a transmembrane helix) by IWEMSAT and SOSUI analyses. In addition, in comparison with EST sequerices the cDNA 
sequence was not found to be 50 bp or more shorter than any EST sequence at their 5"-end. and therefore the done 
was not judged to be a incomplete cDNA clone by using ESTs as criteria for the judgment. 

EXAMPLE 7 
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Gene expression analysis with hybridization using high density DNA filter 

[0132] Nylon membrane for DNA spotting was prepared according to the following procedure E coli was cultured 
in each well of a 96-well plate (in a LB medium at 37'C for 16 hours). A sample of each culture was suspended in 10 
Hi of stenle water in a well of a 96-well plate. T7.e plate was heated at 100-C for 10 minutes. Then, the boiled samples 
were anaVzed by PGR. PGR was perfomied in a 20 solution by using TaKaRa PGR Amplification Kit (Takara) ac- 
cording to the supplier's protocoL Primers used for the amplification of an insert cDNA in a plasmid were a pair of 
oXTmCr.'^' 1^^'"^ '^«^9'9«^«'«9<= 3-) and ME1250RV (5' tgtgggaggttttttctcta 3-), or a pair of 
pnr^ers M13M4 (5 gttttoccagtcacgac 3") and M13RV (5' caggaaacagctatgac 3-). PGR was performed using a thermal 
cyder, GeneAmp System 9600 (PE Biosystems) at 95-C for 5 minutes; at 95°C for 10 seconds and at 68-C for i 
minute lor 10 cydes; at 98-C for 20 seconds and at 60-G for 3 minutes for 20 cycles; and at 72-G for 10 minutes After 
he PGR, the 20 v\ reaction solution was loaded onto a 1% agarose gel and fractionated by eledrophoresis DNA on 
the gel was stained with ethidium bromide to confirm the amplification of cDNA. When cDNAs were not amplified by 
PCR.plasmidscomainingthe corresponding insertcDNAswereprepared by thealte 

E.F., Fntsh. & T. I^aniatis. "Molecular Cloning. A laboratory manual/2nd ediUon. ColdSpring Harbor Laboratory Press' 
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1989). 

[01 33] Preparation of DNA array was carried out by the following procedure, A sample of a DNA solufion was added 
in each well of a 384-well plate. DNA was sponed onto a nylon niembrane (Boehringer) by using a 384-pin tool of 
Biomek 2000 Laboratory Automation System (Beckman-Coulter), Specifically, the 384-well plate containing the DNA 
was placed under the 384-pin tool. The independent 384 needles were simultaneously dipped into the DNA solution 
for DNA deposition. The needles were gently pressed onto a nylon membrane and the DNA deposited at the tips of 
needles was spotted onto the membrane. Denaturation of the spotted DNA and immobilization of the DNA on the nylon 
membrane were carried out according to standard methods (J. Sambrook. E.F.. Fritsh. & T Maniatis. 'Molecular Clon- 
ing. A laboratory manual/ 2nd edition, Cold Spring Harbor Laboratory Press. 1989). 

[0134] A probe for hybridization was radioisotope-labeled first strand cDNA. Synthesis of the first strand cDNA was 
performed by using TTienmoscripi^« RT-PCR System (GIBCO). Specifcally. the first strand cDNA was synthesized by 
using 1.5 jig of mRNAs from various human tissues (Clontech). 1 jil of 50 Oligo(cfT)20 and 50 ^ Ci [a 33pjdATP 
according to an attached protocol. Purification of a probe was canned out by using ProbeQuantT" G-50 mfcro column 
(Amersham-Pharmacia Biotech) according to an attached protocol. In the next step, 2 units of E. coli RNase H were 
added to the reaction mixture. The mixture was incubated at room temperature for 10 minutes, and then, 100 \ig of 
human COT-1 DNA (GIBCO) was added thereto. The mixture was incubated at sy-'C for 10 minutes and then was 
allowed to stand on Ice to give hybridization probe. 

[01 35] Hybridization of the radioisotope-labeled probe to the DNA array was performed according to standard meth- 
ods (J. Sambrook. E.F, Fritsh, & T Maniatis. Molecular Cloning, A laboratory manual/ 2nd edition, Cold Spring Harbor 
Laboratory Press. 1989). The membrane was washed as follows: the nylon membrane was washed 3 times by incu- 
bating it in Washing solution 1 (2 x SSC. 1 % SDS) at room temperature (about 26"C) for 20 minutes; then the membrane 
was washed 3 times by incubaUng it in W^hing solution 2 (0.1 x SSC. 1% SDS) at 65*C for 20 minutes. 
[0136] Autoradiography was perfomned by using an image plate for BAS2000 (Fuji Photo Film Co., Ltd.). Specifcally, 
the nylon membrane with probe hybridized thereon was wrapped with a piece of Saran Wrap and brought into tight 
contact with the image plate on the light-sensitive surface. The membrane with the image plate was placed in an 
imaging cassette for radioisotope and allowed to stand in dark place for 4 hours. The radioactivity recorded on the 
image plate was analyzed by using BAS2000 (Fuji Photo Film Co.. Ltd.). The activity was subjected to electrons 
conversion and recorded as an image file of autoradiogram. The signal intensity of each DNA spot was analyzed by 
using Visage High Density Grid Analysis Systems (Genomfc Solutions Inc.). The signal intensity was converted into 
numerical data. The data were taken in duplicate. Jhe reproducibility was assessed by comparing the signal intensities 
of the corresponding spots on the duplicated DNA filters that were hybridized to a single DNA probe (Figure 2). In 95% 
of entire spots, the ratio behween the con-esponding spots falls within a range of 2 or less, and the correlation coefficient 
is r=1 .97. TTius, the reproducibility is satisfactory. 

[0137] The detecUon sensitivity in gene expression analysis was estimated by examining increases in the signal 
intensity of probe concentration-dependent spot in hybridization using a probe complementary to the DNA spotted on 
the nylon membrane. DNA used was PUVCEI 008092 (the same as DNA deposited in GenBank under an Accession 
No. AF1 07253). The DNA array with DNA of PU\CE1 008092 was prepared according to the above-mentioned method. 
TTie probe used was prepared as follows: mRNA v^s synthesized in vitro from the clone. PLACE1 008092. By using 
this mRNA as a template, radioisotope-labeled first strand cDNA was synthesized in the same manner as descnDed 
above, and the cDNA was used as the probe. In order to synthesize mRNA from PLACE1008092 in vitro, a plasmid 
in which the 5' end of the cDNA PLACE 1008092 was ligated to the T7 promoter of pBluescript SK(-) was constructed. 
Specifk:ally, the PLACE1 008092 insert was cut out from pMEl 8SFL3 canying the cDNA at a Drain site thereof by Xhol 
digestion. The resulting PUVCE1 008092 fragment was ligated to Xhol-predigested pBluescript SK(-) by using DNA 
ligation kit ver.2 (Takara). The in vitro mRNA synthesis from PLjf\CE1 008092 inserted into pBluescript SK(-) was carried 
out by using Ampliscribe^M t7 high yield transcriptton kit (Epicentre technotogies). Hybridization and the analysis of 
signal intensity of each DNA spot were perfomied by the same methods as descn^bed above. When the probe con- 
centration is 1 X 107 ^g/ml or less, there was no increase of signal intensity proportiona! to the probe concentration. 
Therefore; it was assumed to be difficult to compare the signals with one another in this concentratton range. Thus, 
the spots with the intensity of 40 or less were unifonmly taken as low level signals (Rgure 3). Within a concentratiori 
of the probe ranging from 1 x 10^ ^gfm\ to 0.1 ^g^ml, the signal was found to increase in a probe concentration- 
dependent manner. TTie detection limit represented as the ratio of the expression level of test mRNA to that of total 
mRNA in a sample was 1 : 100, 000. 

[01 38] Tables 5-1 61 (also containing clones without description in Examples) show the expression of each cDNA in 
human normal tissues (heart, lung, pituitary gland, thymus, brain, kidney, liver and spleen). The expression levels are 
indcated with numerical values of 0-1 0.000. Genes that were expressed in at least a single tissue are indicated below 
by the corresponding clone names: 

clone; HEMBA1000446. HEMBA1000675. HEMBA1001322. HEM8A1001552. HEMBA1001680, HEMBA1001879 
HEMBA1002441. HEMBA1002706. HEMBA1002715, HEMBA1002913. HEMBA1002981 , HEMBAl00328o! 
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HEMBA1 003702, 

HEMBAl 005628, 

HEMBA1007186. 

NT2RP2000396, 

NT2RP2001142. 

NT2RP2002429, 

NT2RP2003115. 

NT2RP2004966, 

NT2RP3000234, 

NT2RP3001613. 

NT2RP3003549, 

PLACE1000456» 

PLACE1003405, 

PLACE1 005047. 

PLACE1010330. 



HEMBA1 003764, 

HEMBA1 005833. 

NT2RM 1000558, 

NT2RP2000428, 

NT2RP2001270, 

NT2RP2002695, 

NT2RP2003227, 

NT2RP2005219, 

NT2RP3000266, 

NT2RP3001861, 

NT2RP3003731, 

PLACE1001098, 

PLACE1003724, 

PLACE 1005760, 

PLACE1010482, 



HEMBA1004100. 
HEM8A1 006099. 
NT2RP1000125, 
NT2RP2000557. 
NT2RP2001341. 
NT2RP2002907. 
NT2RP2003902, 
NT2RP2005322. 
NT2RP3000326, 
NT2RP3003097. 
NT2RP3003789. 
PLACE1001300, 
PLACE1004113. 
PLACE 1006472, 
PLACE1011134. 



HEMBA1 004633, 
HEMBA1 006391. 
NT2RP1 000279, 
NT2RP2000601. 
NT2RP2001499. 
NT2RP2002927, 
NT2RP2004130, 
NT2RP2005671. 
NT2RP3000638. 
NT2RP3003235. 
NT2RP3004S41, 
PLACE1001904, 
PLACE1 004273, 
PLACEl 006610. 
PLACE1011146, 



HEMBA1 005096, 
HEMBA1 006813, 
NT2RP1 000837, 
NT2RP2000720. 
NT2RP2001508, 
NT2RP2002934, 
NT2RP2004755, 
NT2RP2005970. 
NT2RP3000719. 
NT2RP3003258. 
OVARC1 000636, 
PLACE1002376, 
PLACEl 004757, 
PLACEl 007635. 
PLACE1011360, 



HEMBA1 005452, 
HEM BA1 007 104. 
NT2RP1001023, 
NT2RP2001087, 
NT2RP2001768, 
NT2RP2003050, 
NT2RP2004795, 
NT2RP2006435, 
NT2RP3001359, 
NT2RP3003368. 
OVARC1001849, 
PLACEl 002379, 
PLACEl 004850. 
PLACEl 009580, 
PLACE1011386, 



PLACE1011514, PLACE1011835. 
[01391 Genes that were expressed in all the tissues tested are Indicated below by the corresponding clone names- 
clone: HEMBA1002715, NT2RP1001023. NT2RP2000396. NT2RP21 103902. NT2RP2005970, NT2RP301 13258 
NT2RP3003731 , PLACE1003405, PLACEl 003724,. 

[01 40] Genes that were expressed at low levels in any of the tissues tested are indicated below by the corresponding 
clone names: 

clone: HEMBA1000296, HEMBA1001490. HEMBA1004078, HEMBA1004149. HEMBA1005301 . HEMBA1005703, 
HEMBA1006019. HEMBA1006549, HEMBA1007053. NT2RM1000066, NT2RM1000566, NT2RM1000634' 



NT2RM1 000726, 
NT2RP1 000544, 
NT2RP1001042, 
NT2RP2002115. 
NT2RP2005670, 
NT2RP3002337. 
OVARC1001510. 
PLACEl 0041 70, 
PLACE1007190, 



NT2RM 1000853, 
NT2RP 1000567, 
NT2RP2000028. 
NT2RP2003471, 
NT2RP2006028, 
NT2RP3003536, 
OVARCl 001636, 
PLACE 1004322, 
PLACE 1007338, 



NT2RM1001103. 
NT2RP1 000593, 
NT2RP2000116. 
NT2RP2004036, 
NT2RP2006400, 
NT2RP3004059, 
PLACEl 001 022, 
PLACEl 004507, 
PLACE1007878. 



NT2RP1 000255. 
NT2RP1 000769, 
NT2RP2000168. 
NT2RP2004049, 
NT2RP2006476, 
NT2RP3004063, 
PLACEl 003085. 
PLACEl 004904. 
PLACEl 007885, 



IMT2RP1 000477. 
NT2RP1 000905, 
NT2RP2000279. 
NT2RP2004076, 
NT2RP3001619, 
OVARCl 000363. 
PLACEl 003378. 
PLACE 1006269, 
PLACEl 008738. 



NT2RP 1000533, 
NT2RP1 000921. 
NT2RP2000358, 
NT2RP2004974, 
NT2RP3001874, 
OVARC1001499, 
PLACE 1003549. 
nnnnnnnnnnnn, 
PLACEl 008994. 



PLACEl 009772, PLACEl 01 0021, PLACEl 01 0978. 
[01 41 1 Genes exhibiting characteristic features in the expression thereof were selected by statistical analysis of these 
data. Two examples are shown below to descrtoe the selection of genes of which expression is varied greatly among 
tissues. The p-aclin gene is used frequently as a control in gene expression analysis. Genes of which expression is 
varied greatly among tissues as compared that of the f^actin gene were detemiined as follows. Specifically, sum of 
squared deviation was calculated in the signal intensity of p-acUn observed in each tissue, which was divided by 7 
degrees of freedom to determine a variance S^z Next, sum of squared deviation was calculated in the signal intensity 
of a compared gene in each tissue, which was divided by 7 degrees of freedom to detemnine a variance By taking 
variance ratio F as f^St/^S^^, genes with a significance level of 5% or more were extracted in the F distribution. Genes 
extracted are indicated below by the conresponding clone names: NT2RP1001023(PSEC0045). 
[0142] Gene of OVARCl 000037 (heterogeneous nuclear ribonucleoprotein (hnRNP)) which expression is varied 
little. Genes of which expression is varied greatly among tissues as compared that of the OVARCl 000037 gene v^re 
detemiined as follows. Specifically, sum of squared deviation was calculated in the signal intensity of p-actin observed 
in each tissue, which was divided by 7 degrees of freedom to determine a variance S^^. Next, sum of squared deviation 
was calculated in the signal intensity of a gene to be compared observed in each tissue, which was divided by 7 degrees 
of freedom to detennine a variance S^^. By taking variance ratio F as f=%^/S^^. genes with a significance level of 5% 
or more were extracted in the F distribution. Genes extracted are indicated below by the con-esponding clone names- 
clone: NT2RP1 001 023 {PSEC0045), NT2RP2005970 (PSEC0084), 

[01 43] Thus, characteristic features in the expression of a gene are illustrated by comparing and statistically analyzing 
the expression of many genes. 

Analysis of genes associated with neural cell differentiation 

[0144] Genes involved in neural cell differentiation are useful for treating neurological diseases. It is possible that 
genes with varying expression levels in response to induction of cellular differentiation in neural cells are associated 
with neurological diseases. 
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[01 45] A survey was performed for genes of which expression levels are varied in response to induction of differen- 
tiation (stimulation by relinoic add (RA)) in cultured cells of a neural strain, NT2. 

[01 461 The NT2 cells were treated basically according to supplier's instmction manual. -Undifferentiated NT2 cells- 
means NT2 cells successively cultured in an Opti-MEM I (GIBCO-BRL; catalog No. 31985) containing lO%(vA/) fetal 
bovine serum and 1%(v^) penicillin-streptomycin (GIBCO BRL). -NT2 cells cultured in the presence of retinoic acid- 
means the cells resulted from transfenring undifferentiated NT2 cells into a retinoic acid-containing medium which 
consists of D-MEM (GIBCO BRL; catalog No. 11 965), 10%(v^) fetal bovine serum, 1%(v^) penicillin-streptomycin and 
10 >iM reiinoic actd (GIBCO-BRL). and the subsequent successive culture therein for 5 weeks. ■NT2 cells that were 
cultured in the presence of retinoic acid and then further cultured in the presence of cell-division inhibitor added" means 
NT2 cells resulted from transferring NT2 cells cultured in the presence of retinoic acid for 5 weeks into a cell-division 
mhibitor-<x)ntaining medium, whteh consisted of OMEM(GIBC0 BRL; catalog No.1 1 965). 1 0%(v/v) fetal bovine serum 
1%(v^) penicillin-streptomycin. 10 jiM retinoic acid, 10 ^iM FudR (5-fluoro-^Kleoxyuridine: GIBCO BRL), W^M Urd 
(Uridine: GIBCO BRL) and 1 araC (Cytosine p-D-Arabinofuranoside: GIBCO BRL). and the subsequence succes- 
sive culture for 2 weeks. Each of the cells were treated with trypsin and then harvested. Total RNAs were extracted 
from the cells by using S.N.A.P.(™) Total RNA Isolation kit (InvitrogeniO). jhe labeling of probe used for hybridization 
was earned out by using 10 ^g of the total RNA according to the same methods as described above. The data were 
obtained in triplicate {n-3). The data of signal value, representing gene expression level in the cells in the presence of 
stimulation for inducing differentiation were compared with those in the absence of the stimulation. The comparison 
was performed by statistical treatment of two-sample t-tesl. Clones with significant difference in the signal distribution 
were selected under the condition of p<0.05. In this analysis, clones with the difference can be statistk:ally detected 
even when the signals were low. Accordingly, clones with signaJ value of 40 or less were also assessed for the selection 
[0147] Tables 1 62-341 show the expression level of each cDNA in undifferentiated NT2 cells, NT2 cells cultured in 
the presence of RA. and NT2 cells that were cultured in the presence of RA and that were further cultured in the 
presence of cell-division inhibitor added, 

[0148] Averaged signal values (M,. Mg) and sample variances (s,2. ^Z) were calculated for each gene in each of 
the cells, and then, the pooled sample variances s^ were obtained from the sample variances of the two types of cells 
to be compared. The t values were determined according to the foltowing fonnula: t=(M1-M2)/s/(1/a+-1/3)i/2 When the 
deienmined t-value was greater than a t-value at P. which means the probability of signifcance level, of 0 05 or 0 01 
in the t-distnbution table with 4 degrees of freedom, the difference was judged to be found in the expression level of 
the gene between the two types of cells at p<0.05 or p<o.Ol . respectively- The tables also include the information on 
an increase (+) or decrease f ) in the expression level of a gene in the treated cells when the level is compared with 
that of untreated undifferentiated cells. 

[0149] Clones of which expression levels increased by RA are as follows: 

PSEC0017. PSEC0021. PSEC0041. PSEC0047. PSEC0049. PSEC0055. PSEC0066. PSEC0070 PSEC0071 
PSEC0072. PSEC0074. PSEC0075. PSEC0076, PSEC0080, PSEC0084. PSEC0088. PSECOO94' PSECOIO3' 
PSEC0105. PSEC0112. PSEC0113. PSEC0119. PSEC0127. PSEC0129, PSEC0139. PSEC0143* PSECOI44' 
PSEC0152. PSEC0171. PSEC0181, PSEC0182. PSEC0192. PSEC0195. PSEC0200 PSEC0203 PSECO215' 
PSEC0223. PSEC0235, PSEC0239. PSEC0243. PSEC0255, PSEC0265. 
[0150] Clones of whk;h expression levels increase by RA/inhibitorare as follows: 

PSEC0017. PSEC0019, PSEC0030. PSEC0041. PSEC0047, PSEC0048, PSEC0049, PSEC0059 PSEC0066 
PSEC0072. PSEC0081. PSEC0084. PSEC0094. PSEC0104, PSEC0117. PSEC0119, PSECOI20' PSECOI29' 
PSEC0136, PSEC0139. PSEC0143. PSEC0152. PSEC0161. PSEC0169. PSEC0181. PSECOI82' PSECOI92' 
PSEC0203. PSEC0223, PSEC0235. PSEC0251. PSEC0265. 

[0151] Clones of which expression levels increase in the presence of both RA and RA/inhibitor are as follows- 
PSEC0017. PSEC0041. PSEC0047. PSEC0049. PSECOO66. PSEC0072, PSEC0084 PSEC0094 PSEC0119 
PSEC0129. PSEC0139. PSEC0143. PSEC0152. PSECOI8I. PSEC0182, PSEC0192 PSECO203' PSEC0223* 
PSEC0235. PSEC0265. ' 
[01 52] These are neurological disease-associated clones. 
[01 53] Analysis of rheumatoid arthritis-associated genes 

[0154] The onset of rheumatoid arthritis is thought to be involved in the proliferation of synovial cells covering inner 
surfaces of joint cavity and in inflammatory reaction resulted from the action of cytokines produced by leukocytes 
infiltrating into the joint synovial tissues (Rheumatism Infomiation Center hltpV/www.rheu ma-neLor.jp/). Recent studies 
have also revealed that tissue necrosis factor (nsiF)-« participates in the onset (Current opinion in immunology 1 999 
1 1 . 657-662). When the expression of a gene exhibits responsiveness to the action of TNF on synovial cells, the gene 
IS considered to be involved in rtieumatoid arthritis. 

[0155] A survey was perfonmed for genes of which expression levels are varied in response to TNF-a in the primary 
cell culture of synovial tissue. The primary cultured cells of the smooth muscle (Cell Applications) were grown to be 
confluent in a culture dish, and then, human TNF-a (Boehringer-Mannheim) was added at a final concentration of 10 
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ng/ml itiereto. The culture was further continued for 24 hours. 

[0156] Total RNA was extracted from the cells by using S.N.A.R(^«) Total RNA Isolation kit (Invitrogen). TTie labeling 
of probe used for hybridization was carried out by using 10 ng of the total RNA according to the same methods as 
described above. The data were obtained in triplicates (n=3). TTie data of signal value representing gene expression 
level in the cells in the presence of TNF stimulation were compared with those in the absence of the stimulation The 
comparison was perfomied by stalisUcal treatment of two-sample t-test. Clones with significant difference in the signal 
drstribution were selected under the condition of p<0-05. In this analysis, clones with the difference can be statisticairy 
detected even when the signals were low. Accordingly, clones with signal value of 40 or less were also assessed for 
the selection. 

(01 57] Table 343 shows the expression level of each cDNA in synovial cells cultured in the absence or presence of 
TNF. 

[0158] Averaged signal values (M„ Mg) and sample variances (s,2. s^^) for each gene were calculated in each of 
the cells, and then, the pooled sanrtple variances s2 were obtained from the sample variances of the two types of cells 
to be compared. TDe l-values were determined according to the following formula: H^yU^Isim^mff^ When the 
determined t-value was greater than a t-value at P. which means the probability of significance level of 0 05 or 0 01 
fn the t-detnbution table with 4 degrees of freedom, the difference was judged to be found in the expression level of 
the gene between the two types of cells at p<0.05 or p<0.0l . respectively. The tables also include the information of 
an increase (+) or decrease (-) in the expression level of a gene in the stimulated cells when the level is compared 
with that of unstimulated cells. 

[0159] PSEC clones of which expression levels are elevated by TNF-a are as follows* ' 

PSECOOyo. PSEC0073. PSEC0084. PSEC0100. PSEC0109. PSEC0120. PSEC0131. PSEC0161 PSEC0183 
PSEC0192. PSEC0197, PSEC0205. PSEC0207. PSEC0210. PSEC0213. PSEC0222. PSEC023o' PSEC024l' 
PSEC0252. PSEC0259. ' ' 

[0160] PSEC clones of which expression levels decrease by TNF-a are as follows- 
PSEC0105, PSEC0245. 

[0161] These are rheumatoid arthritis-associated clones. 

Table 5 

Expression of each cDNA in human tissues (containing clones that are not described in 
Examples.) 
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Claims 

1 . An isolated potynucleotide selected from the group consisting of 

(a) a polynucleotide comprising a coding region of the nucleotide sequence set forth in any one of the followina 
SEQ ID NOs: SEQ ID NO: 1 . 3. - • - 347. and, 349; 

(b) a polynucleotide comprising a nucleotide sequence encoding a protein comprising the amino acid sequence 
set forth in any one of the following SEQ ID NOs: SEQ ID NO: 2. 4. ... 348, and, 350; 

(c) a polynucleotide comprising a nucleotide sequence encoding a protein comprising an amino acid sequence 
selected from the amino acid sequences of (b). in which one or more amino acids are substituted, deleted 
inserted, and^or added, wherein said protein is functionally equivalent to the protein comprising said amino 
acid sequence selected from the amino acid sequences of (b); 

(d) a polynucleotide that hybridizes with a polynucleotide comprising a nucleotide sequence selected from the 
nucieoUde sequences of (a), and that comprises a nucleotide sequence encoding a protein functionally equiv- 
alent to the protein encoded by the nucleotide sequence selected from the nucleotide sequences of (a); 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a protein 
encoded by the polynucleotide of (a) to - (d); 

(0 a polynucleotide comprising a nucleotide sequence with at least 70% identity to the nucleotide sequence 
of (a). 

2. A substantially pure protein encoded by the polynucleotide of claim 1 . 

3. Use of an oligonucleotide as a primer for synthesizing the polynucleotide comprising the nucleotide sequence set 
forth in any one of SEQ ID NOs: 370-540 or the complementary strand thereof, wherein said oligonucleotide is 
complementary to said polynucleotide or the complementary strand thereof and comprises at least 1 5 nucleotides. 

4. A primer set for synthesizing polynucleotides, the primer set comprising an oligcKfT primer and an oligonucleotide 
complementary to the complementary strand of the polynucleotide comprising the nucleotide sequence set forth 
in any one of SEQ !D NOs: 370-540, wherein said oligonucleotide conrrprises at least 1 5 nucleotides. 

5. A primer set for synthesizing polynucleotides, the primer set comprising a combination of an oligonucleotide com- 
prising a nucleotide sequence complementary to the complementary strand of the polynucleotide comprising a 5*- 
end nucleotide sequence and an oligonucleotide comprising a nucleotide sequence complementary to the poly- 
nucleotide comprising a 3'-end nucleotide sequence, wherein said oligonucleotides comprise at least 15 nucle- 
otides and wherein said combination of 5'-end nucleotide sequenceA3'-end nucleotide sequence is selected from 
the group consisting of: SEQ ID NO: 391/SEQ ID NO: 541 , - - - and SEQ ID NO: 540/SEQ ID NO: 679 

6. A polynucleotide which can be synthesized with the primer set of claim 4 or 5. 

7. A polynucleotide comprising a coding region in the polynucleotide of daim 5. 

8. A substantially pure protein encoded by polynucleotide of claim 7. 

9. A partial peptide of the protein of claim 8. 

10. An antibody against the protein or peptide of any one of claims 2. 8, and 9. 

1 1 . A vector comprising the polynucleotide of claim 1 or 7. 

12. A iransfomrtant canying the polynucleotide of claim 1 or 7. or the vector of claim 11 . 

13. A transformant expressively carrying the polynucleotide of claim 1 or 7, or the vector of claim 11 . 

1 4. A method for producing the protein or peptide of any one of claims 2. 8. and 9. comprising culturing the transformant 
of claim 13 and recovering the expression product. 

15. An oligonucleotide comprising the nucleotide sequence of daim 1 (a) or the nucleotide sequence complementary 
to the complementary strand thereof, wherein said oligonucleotide comprises 15 nucleotides or more. 
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16. Use of the oligonucleotide of claim 1 5 as a primer for synthesizing a polynucleotide. 

17. Use of the oligonucleotide of claim 15 as a probe for detecUng a gene. 

18. An antisense polynucleotide against the polynucleotide of claim 1 . or the portion thereof. 

19. A method for synthesizing a polynucleotide, the method comprising: 

• b) recovering the synthesized product. 

20. TTie method of claim 1 9. wherein the cDNA library is obtainable by oligo-capping method. 

21. The method of claim 19. wherein the complementary strand is obtainable by PGR. 

22. A method for detecting the fK)lynucleotide of claim 1 . the method comprising: 

a) incubating a target polynucleotide with the oligonucleotide of claim 15 under the conditions where hybridi- 
zation occurs, and ' 

b) detecting the hybridization of the target polynucleotide with the oligonucleoHde of claim 1 5. 

tSfri^fr! Of polynucleotides and/or proteins, the database comprising information on at least one sequence 
selected f^om the nucleotde sequences of claim 1 (a) and^or the amino acid sequences of claim 1 (b). or a medium 
on which the database is stored. cuium 
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Figure 2 
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Figure 3 
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Figure 4 
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Figures 
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Figure 6 
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